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Abstract 

We study the role that parallelism plays in time complexity of variants of Winfrce's ab- 
stract Tile Assembly Model (aTAM), a model of molecular algorithmic self-assembly In the 
"hierarchical" aTAM, two assemblies, both consisting of multiple tiles, are allowed to aggregate 
together, whereas in the "seeded" aTAM, tiles attach one at a time to a growing assembly. Adlc- 
man, Cheng, Goel, and Huang (Running Time and Program Size for Self- Assembled Squares, 
STOC 2001) showed how to assemble an n x n square in 0(n) time in the seeded aTAM using 
^ dog'iogn ) unique tile types, where both of these parameters are optimal. They asked whether 
the hierarchical aTAM could allow a tile system to use the ability to form large assemblies in 
parallel before they attach to break the fi(n) lower bound for assembly time. We show that 
there is a tile system with the optimal O( lo ^g w ) tile types that assembles annxn square using 
0(log 2 n) parallel "stages", which is close to the optimal fi(logn) stages, forming the final n x n 
square from four n/2 x n/2 squares, which are themselves recursively formed from n/A x n/A 
squares, etc. However, despite this nearly maximal parallelism, the system requires supcrlincar 
time to assemble the square. We extend the definition of partial order tile systems studied by 
Adlcman et al. in a natural way to hierarchical assembly and show that no hierarchical partial 
order tile system can build any shape with diameter N in less than time VL(N), demonstrating 
that in this case the hierarchical model affords no speedup whatsoever over the seeded model. 
We also strengthen the ft(N) time lower bound for deterministic seeded systems of Adleman et 
al. to nondeterministic seeded systems. Finally, we show that for infinitely many n, a tile system 
can assemble an n x n' rectangle, with n > n' , in time 0(n 4 / 5 log n), breaking the linear-time 
lower bound that applies to all seeded systems and partial order hierarchical systems. 



1 Introduction 

Tile self-assembly is an algorithmically rich model of "programmable crystal growth" . It is possible 
to design molecules (square- like "tiles") with specific binding sites so that, even subject to the 
chaotic nature of molecules floating randomly in a well- mixed chemical soup, they are guaranteed 
to bind so as to deterministically form a single target shape. This is despite the number of different 
types of tiles possibly being much smaller than the size of the shape and therefore having only 
"local information" to guide their attachment. The ability to control nanoscale structures and 
machines to atomic-level precision will rely crucially on sophisticated self-assembling systems that 
automatically control their own behavior where no top-down externally controlled device could fit. 

A practical implementation of self-assembling molecular tiles was proved experimentally fea- 
sible in 1982 by Seeman [43] using DNA complexes formed from artificially synthesized strands. 
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Experimental advances have delivered increasingly reliable assembly of algorithmic DNA tiles with 
error rates of 10% per tile in 2004 [39], 1.4% in 2007 [19], 0.13% in 2009 [7], and 0.05% in 2010 [17]. 
Erik Winfree [49] introduced the abstract Tile Assembly Model (aTAM) - based on a constructive 
version of Wang tiling [47,48] - as a simplified mathematical model of self-assembling DNA tiles. 
Winfree demonstrated the computational universality of the aTAM by showing how to simulate 
an arbitrary cellular automaton with a tile assembly system. Building on these connections to 
computability, Rothemund and Winfree [38] investigated a self-assembly resource bound known 
as tile complexity, the minimum number of tile types needed to assemble a shape. They showed 
that for most n, the problem of assembling annxn square has tile complexity &X i gj^g n ); an d 
Adleman, Cheng, Goel, and Huang [3] exhibited a construction showing that this lower bound is 
asymptotically tight. Under natural generalizations of the model [1,6,8,11-14,27,28,32,44,45], tile 
complexity can be reduced for tasks such as square-building and assembly of more general shapes. 

The authors of [3] also investigated assembly time for the assembly of n x n squares in addition to 
tile complexity. They define a plausible model of assembly time based (implicitly) on the standard 
stochastic model of well- mixed chemical kinetics [21-23] and show that under this model, an n x n 
square can be assembled in expected time 0(n), which is asymptotically optimal, in addition to 
having optimal tile complexity 0( ^°f^ n ). Intuitively, the optimality of the 0(n) assembly time 
for an n x n square results from the following informal description of self-assembly. The standard 
"seeded" aTAM stipulates that one tile type is designated as the seed from which growth nucleates, 
and all growth occurs by the accretion of a single tile to the assembly containing the seed. The 
set of locations on an assembly a where a tile could attach is called the frontier. An assembly 
with a frontier of size k could potentially have Q(k) attachment events occur in parallel in the 
next "unit" of time, meaning that a speedup due to parallelism is possible in the seeded aTAM. 
The geometry of 2D assembly enforces that any assembly with N points has an "average frontier 
size" throughout assembly of size at most 0{\fN)} Therefore, the parallelism of the seeded aTAM 
grows at most linearly with time. To create an n x n square of size n 2 , the best parallel speedup 
that one could hope for would use an "average frontier size" of 0(n), which in O(n) "parallel steps" 
of time assembles the entire square. This is precisely the achievement of [3]. 

A variant of the aTAM known as the hierarchical (a.k.a. two-handed, recursive, multiple tile, 
q-tile, aggregation, polyomino) aTAM allows non-seed tiles to aggregate together into an assembly, 
allows this assembly to then aggregate to other assemblies, and possibly (depending on the model) 
dispenses completely with the idea of a seed. Variants of the hierarchical aTAM have recently 
received extensive theoretical study [1,2,4,6,12,13,15,18,31,33,34,51]. It is conceivable that by 
allowing two large assemblies to form in parallel and combine in one step, it may be possible to 
recursively build annxn square in o(n) time, perhaps even O(logn) or 0(polylog(n)) time. In the 
terminology of Reif [35], such parallelism is "global" rather than "local." Determining the optimal 
time lower bound for uniquely self-assembling annxn square in the hierarchical aTAM was stated 
as an open problem in [3]. 

We achieve three main results. Stated informally, we show a highly parallel (but surprisingly, 
slow) assembly of an n x n square in a hierarchical system. We prove that no "partial order 
hierarchical system" (defined below) can break the fi(n) lower bound for any shape of diameter 
n. Finally, we show that a hierarchical system violating the "partial order" property is able to 

For intuition, picture the fastest growing assembly: a single tile type able to bind to itself on all sides, filling the 
plane starting from a single copy at the origin. After t "parallel steps" , with high probability it has a circumference, 
and hence frontier size, of 0(t), while occupying area 0(t 2 ). 
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assemble a rectangle of diameter n in time o(n). We now discuss these results in more detail. 

We show that in the hierarchical aTAM, it is indeed possible to assemble an n x n square 
using nearly maximal "parallelism," so that the full n x n square is formed from four n/2 x n/2 
sub-squares, which are themselves each formed from four n/4 x n/4 sub-squares, etc. If one were 
to assume a constant time for any two producible assemblies to bind once each is produced, this 
would imply a polylogarithmic time complexity of assembling the final square. But accounting 
for the effect of assembly concentrations on binding rates, our construction takes superlinear time, 
since some sub-square has concentration at most 0(l/n 2 ), so the time for even a single step of 
hierarchical assembly is at least fi(n 2 ) by standard models of chemical kinetics. 

In [3] the authors define a class of deterministic seeded tile systems known as partial order 
systems, which intuitively are those systems that enforce a precedence relationship (in terms of 
time of attachment) between any neighboring tiles in the unique terminal assembly that bind with 
positive strength. We extend the definition of partial order systems in a natural way to hierarchical 
systems, and for this special case of systems, we answer the question of [3] negatively, showing that 
£l(N) time is required to assemble any structure with diameter N. This implies in particular that 
the Q(n) lower bound in the seeded model for assembling an n x n square applies to partial order 
systems in the hierarchical model. To obtain this result, it is necessary to introduce a definition 
of assembly time applicable to both seeded and hierarchical tile systems. We define such a model 
based on chemical kinetics. When applied to seeded systems, the model results in (nearly) the 
same definition used in [3], in the limit of low concentration of seed tiles. 2 

Thus, for the purpose of speeding up the process of self-assembly, the parallelism of the hi- 
erarchical assembly model is of no use whatsoever in partial order systems. We note, however, 
that there are other theoretical advantages to the hierarchical model, for instance, the use of steric 
hindrance to enable algorithmic fault-tolerance [15]. For this reason, our highly parallel square 
construction may be of independent interest despite the fact that the parallelism does not confer 
a speedup. Informally, if tile system T uniquely self-assembles a shape S, define depth da (T) to be 
the worst-case "number of parallel assembly steps" (depth of the tree that decomposes the final 
assembly recursively into the subassemblies that combined to create it) required by the tile system 
to reach its final assembly. (A formal definition is given in Section 3.) Clearly depth da (T) > log |5| 
if S is the shape assembled by T ■ Our construction is quadratically close to this bound in the 
case of assembling an n x n square S n , showing that depth da (7~) < 0(log 2 n). Furthermore, this is 
achievable using 0( ^°f^ n ) tile types, which is asymptotically optimal. 3 That is, not only is it the 
case that every producible assembly can assemble into the unique terminal assembly (by the defi- 
nition of unique assembly), but in fact every producible assembly is at most 0(log 2 n) attachment 
events from becoming the terminal assembly. 

Demaine, Demaine, Fekete, Ishaque, Rafalin, Schweller, and Souvaine [12] studied a similar 
complexity measure called stage complexity for another variant of the aTAM known as staged 

2 Low seed concentration is required to justify the assumption used in [3] of constant concentration of non-seed 
tiles, so we are not "cheating" by using this assumption to argue that the models nearly coincide on seeded systems. 
The one sense in which the models are different for seeded systems is that tile concentrations are allowed to deplete 
in our model. As we argue in Section 4.2, this difference does not account for our time lower bound. Furthermore, 
this difference makes our model strictly more realistic than the model of [3]. Tile systems in which this difference 
would be noticeable are those in which large assemblies not containing the seed can form, which are assumed away in 
the seeded model. Such systems are precisely those for which the assumptions of the seeded model are not justified. 

^Without any bound on tile complexity, the problem would be trivialized by using a unique tile type for each 
position in the shape, each equipped with specially placed bonds as in Figure 1 to ensure a logarithmic-depth assembly 
tree. 
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assembly. In the staged assembly model, a hierarchical model of attachment is used, with the 
added ability to prepare different assemblies in separate test tubes. The separate test tubes are 
allowed to reach a terminal state, after which individual tile types are assumed to be washed away, 
before combining the tubes. The stage complexity of a tile system is similarly defined to be the 
depth of the "mixing tree" describing the order of test tube mixing steps. Our model is more 
restrictive by permitting only one test tube ("bin complexity 1" in the language of [12]). In a 
sense, we are able to "automate" the highly selective mixing that is assumed to be externally 
controlled in the staged assembly model, while paying only a quadratic price in the number of 
parallel assembly stages required (and naturally paying a price in tile complexity as well, since our 
system, unlike the staged model, must encode the size n of the square entirely in the tile types). 
The primary challenge in achieving a highly parallel square construction in the hierarchical model 
is the prevention of overlapping subassemblies. Adleman [2] showed a fi(n) lower bound (in a much 
different and more permissive model of assembly time than in the present paper; later improved 
to O(nlogn) within the same model by Adleman, Cheng, Goel, Huang, and Wasserman [4]) for 
the problem of assembling a 1 x n line from n distinct tile types ti,...,t n . The main intuitive 
reason that the time is not 0(log n) is that if assemblies a\ = ti . . . tj and a% = ty . . . ty form, with 

1 < i' < j < f, then ct\ can never attach to «2 because they overlap. Staged assembly can be used 
to control the overlap directly by permitting only the growth of lines covering dyadic intervals. 

Finally, to show that our partial order hypothesis is necessary to obtain a linear-time lower 
bound, we construct a hierarchical tile system that, according to our model of assembly time, 
can assemble a rectangle in time sublinear in its diameter. More precisely, we show that for 
infinitely many n, there is a hierarchical tile system that assembles an n x n' rectangle, where 
n > n', in time 0(n 4//5 logn). The key idea is the use of both "assembly parallelism" and "binding 
parallelism." By "assembly parallelism," we mean the form of parallelism discussed above, the 
ability of the hierarchical model to form multiple large assemblies independently in parallel. By 
"binding parallelism," we mean the (much more modest) parallelism already present in the seeded 
model, the ability of a single tile or assembly to bind have multiple potential binding sites to 
which to attach on the "main" growing assembly. If there are k such binding sites, the first such 
attachment will occur in expected time ^ times that of the expected time for any fixed binding site 
to receive an attachment, a fact exploited in our tile system to achieve a speedup. 

2 Informal Description of the Abstract Tile Assembly Model 

This section gives a brief informal sketch of the seeded and hierarchical variants of the abstract 
Tile Assembly Model (aTAM). See Section A for a formal definition of the aTAM. 

A tile type is a unit square with four sides, each consisting of a glue label (often represented 
as a finite string) and a nonnegative integer strength. We assume a finite set T of tile types, but 
an infinite number of copies of each tile type, each copy referred to as a tile. An assembly (a.k.a., 
supertile) is a positioning of tiles on the integer lattice Z 2 ; i.e., a partial function a : Z 2 — > T. 
Write q C /3 to denote that a is a subassembly of f3, which means that dom a C dom (3 and 
a (p) = Pip) for all points p € dom a. In this case, say that (3 is a superassembly of a. We abuse 
notation and take a tile type t to be equivalent to the single-tile assembly containing only t (at the 
origin if not otherwise specified). Two adjacent tiles in an assembly interact if the glue labels on 
their abutting sides are equal and have positive strength. Each assembly induces a binding graph, 
a grid graph whose vertices are tiles, with an edge between two tiles if they interact. The assembly 
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is r-stable if every cut of its binding graph has strength at least r, where the weight of an edge is 
the strength of the glue it represents. That is, the assembly is stable if at least energy r is required 
to separate the assembly into two parts. The frontier da CZ 2 \ dom a of a is the set of empty 
locations adjacent to a at which a single tile could bind stably. 

A seeded tile assembly system (seeded TAS) is a triple T = (T,a,r), where T is a finite set of 
tile types, a : Z 2 — > T is a finite, r-stable seed assembly, and r is the temperature. An assembly a 
is producible if either a = a or if /3 is a producible assembly and a can be obtained from /3 by the 
stable binding of a single tile. In this case write f3 — >\ a (a is producible from f3 by the attachment 
of one tile), and write (3 — >■ a if —>* a (a is producible from (3 by the attachment of zero or more 
tiles). An assembly is terminal if no tile can be r-stably attached to it. 

A hierarchical tile assembly system (hierarchical TAS) is a pair T = (T,t), where T is a finite 
set of tile types and r G N is the temperature. An assembly is producible if either it is a single tile 
from T, or it is the r-stable result of translating two producible assemblies without overlap. An 
assembly a is terminal if for every producible assembly /3, a and /3 cannot be r-stably attached. The 
restriction on overlap is a model of a chemical phenomenon known as steric hindrance [46, Section 
5.11] or, particularly when employed as a design tool for intentional prevention of unwanted binding 
in synthesized molecules, steric protection [24-26]. 

In either the seeded or hierarchical model, let A[T] be the set of producible assemblies of T, 
and let *4n[T] C A[T] be the set of producible, terminal assemblies of T- A TAS T is directed 
(a.k.a., deterministic, confluent) if |-4o[7~]| = 1. Given a connected shape X C Z 2 , a TAS T strictly 
self-assembles S if every producible, terminal assembly places tiles exactly on those positions in X. 

3 Nearly Maximally Parallel Hierarchical Assembly of an n x n 
Square with Optimal Tile Complexity 

In this section we show that under the hierarchical model of tile assembly, it is possible to self- 
assemble an n x n square, for arbitrary n 6 Z + , using the asymptotically optimal O( lo 1 ° 1 ^ w ) number 
of tile types. Furthermore, the square assembles using nearly the maximum possible parallelism in 
the hierarchical model, building the final square out of four assembled sub-squares of size n/2 x n/2, 
which are themselves each assembled from four sub-squares of size n/4 x ra/4, etc. The sub- 
optimality stems from the need for us to construct the smallest sub-squares of size O(logra) x 
O(logra) = 0(log 2 ra) without parallelism. 

We formalize the notion of "parallelism through hierarchical assembly" as follows. 

Let T = (T, r) be a directed hierarchical TAS. Let a £ -4[7~] be a producible assembly. An 
assembly tree T of a is a binary tree whose nodes are labeled by producible assemblies, with a 
labeling the root, individual tile types labeling the |a| leaves, and node v having children u\ and U2 
with the requirement that u± and U2 can attach to assemble v. That is, T represents one possible 
pathway through which a could be produced from individual tile types in T ■ Let Y(T) denote the 
set of all assembly trees of T ■ Say that an assembly tree is terminal if its root is a terminal assembly. 
Let Tq (T) denote the set of all terminal assembly trees of T ■ Note that even a directed hierarchical 
TAS can have multiple terminal assembly trees that all have the same root terminal assembly. The 
assembly depth of T is depth da (T) = max TeTn (7-) depth(T), where depth(T) denotes the standard 
depth of the tree T, the length of the longest path from any leaf to the root. 

It is clear by the definition that for any shape S with points strictly self-assembled by a 
tile system 7~, depth da (T) > log A. Our construction nearly achieves this bound in the case of 
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assembling an n x n square S n , while simultaneously obtaining optimal tile complexity O( lo °^ n ). 
In other words, not only is it the case that every producible assembly can assemble into the terminal 
assembly (by the definition of directed), but in fact every producible assembly is at most 0(log 2 n) 
attachment events from becoming the terminal assembly. 

In [6] , the authors prove that whenever n G N is algorithmically random, at least (log n / log log n) 
tile types are required to strictly self-assemble annxn square in the hierarchical model. Actually, 
that paper states only that this holds for the q-tile model, in which some constant q exists that 
limits the size of attachable assemblies other than those containing a special seed tile, and the 
authors claim that the proof requires the bound q, but in fact their proof does not use the bound 
q and works for the general hierarchical model [42]. 

Theorem 3.1. For all n £ N, there is a hierarchical TAS T = (T, 2) such that T strictly self- 
assembles annxn square, \T\ = O( lo 1 ° 1 ^ n ), and depth da (T) = 0(log 2 n). 

Proof. A high-level outline of the construction is shown in Figure 1. We assemble a number of 
blocks of width O(logn) x O(logn), each of which represents in its tile types an address indicating 
its position in the square, and the block binds only to (some of) its neighboring blocks. The blocks 
assemble using standard single-tile accretion (actually we cannot directly enforce this in the model, 
but each block will nonetheless assemble the same structure in either model). Since each block 
is 0(log 2 n) total tiles, this is the source of the suboptimal (9(log 2 n) assembly depth. Once the 
blocks are assembled, however, they assemble into the full square using O(logn) assembly depth. 
All blocks (x, y) with x even bind to (x + 1) to create the two-block assembly (x, y) : (x + 1, y), then 
all blocks (x, y) : (x + 1, y) with y even bind to (x, y + 1) : [x + 1, y + 1) to create the four-block 
assembly (x, y) : (x + l,y) : (x, y + 1) : (x + 1, y + 1), etc. 

The construction will actually control the width of the square only to within an additive log- 
arithmic factor by bring together blocks of width and length O(logn); standard techniques can 
be used to make the square precisely n x n. For instance, we could add O(logn) total filler tiles 
to the leftmost and bottommost blocks, while adding only 0(log 2 n) to the assembly depth and 
O (log log n) to the tile complexity since such filler tiles could be assembled from a counter that 
counts to logn using O (log log n) tile types. For simplicity we describe the desired width n as the 
number of blocks instead of the desired dimensions of the square and omit the details of this last 
step of filling in the logarithmic gap. 

Figure 1 outlines the construction of a square when the number of blocks n is a power of two. 
Figure 2 shows how to modify the blocks so that some of them are double in width, double in height, 
or both, to achieve a total square width that is an arbitrary positive integer. Each block contains 
the same 0( ^°f^ n ) tile types that encode n, and as the block assembles it randomly chooses x and 
y-coordinates, which represent an index in the square. This random choice is implemented through 
competition between tile types that share the same "input" glues but represent different bits of 
x or y. These are used to determine the block's own size and to determine what series of bumps 
and dents to place on its perimeter to enforce that the only blocks that can bind are adjacent in 
Figure 1. The coordinates are also used to determine where to place strength- 1 glues. The same 
strength- 1 glue is used uniformly throughout Figure 1. The bumps and dents ensure that no two 
blocks can attach unless they are adjacent in the figure. 

The growth of an individual block is shown in Figure 3. We describe the assembly as if it grows 
only by single-tile accretion. There are some strength-2 glues so this is not completely accurate, 
but the growth of the block is "polyomino-safe" , to borrow a term of Winfree [51]. By design, no 
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Figure 1: Overview of the hierarchical TAS that assembles antixn square with 0(log 2 n) assembly depth 
and O{ \o°fog n ) tile complexity. Each square in the figure represents a block of width O(logn) with each side 
of each block encoding its (x, ?/)-address in the square. (The encoding scheme is shown in more detail in 
Figure 3.) Each of the thin solid lines is a strcngth-1 glue; in fact they are all identical. Dotted lines connect 
those glues that are intended to bind to each other. The circled subassemblies show the order of growth of 
one particular block (at coordinates (3,2)) into the final square. 



assembly larger than four can form except by attachment to the growing block, and even if these 
assemblies attach at once to the block rather than by single-tile accretion, the correct operation of 
the block growth is unaffected. This is due to the fact that all strength-2 glues are "one-to-one" ; 
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For nxn square, choose k so 
that 2 k <n<2 k+ i\ 
n=13;/<=3 

r 



2 k x2 k =8x8 total blocks used 



first n - 2 k = 5 blocks on 
either dimension doubled in 
size along that dimension 




Figure 2: Design of block sizes to handle values of n that are not a power of two. There are always exactly 
2 k x 2 k blocks, where 2 k < n < 2 k+1 . Each block doubles the length of its along the x-axis (resp. y-axis) if 
n — 2 k exceeds its x-coordinate (resp. y-coordinatc). 



no strength-2 glue is shared as an "output" (in the direction of growth in the seeded model) by two 
different tile types. This implies that no partial assembly occurring away from the main "seeded" 
assembly can grow "backward" and place an incorrect tile. 

To form a square of size nxn "units" (where a unit is 0(log n), the width and height of a small 
block), we choose the largest power of two 2 k < n and assemble exactly 2 k x 2 k different types 
of blocks, doubling the width (resp. height) of the first n — 2 k of them in the x-direction (resp. 
y-direction), as in Figure 2. The orange (medium darkness in gray-scale) tile types in Figure 3 
are the only non-constant set of tile types. Borrowing a technique from [3], we will represent n in 
base 6, where b ~ log nj log log n, using ~ log nj log log n unique tile types, and we use a constant 
number of tile types to convert n — 2 k to binary and accomplish all the other tasks needed to 
assemble the block. 

Choose b = 2 m to be a power of two such that log nj log log n < b < 2 log nj log log n. Each digit 
in base b can represent m bits of n — 2 k . n — 2 k is encoded in exactly m ■ \^\ = Q( i gk>gn ) rjase "^ 
digits. The blue (dark in gray-scale) tile types in Figure 3 convert n—2 k from base b to binary and at 
the same time represent n — 2 k with its "bit-quadrupled" version (e.g., 0110 i— > 0000111111110000), 
since each bit along the edge will eventually require width four to make room to place the bumps 
properly. 4 The counterclockwise order of growth ensures that if not all of the bumps and dents are 
formed, then at least one of the four strength- 1 glues necessary for an attachment event to occur 
is not yet present in one of the blocks. To ensure that the TAS is directed, we do not include 
base-conversion tiles for any digit d € {0, 1, . . . , b — 1} that does not appear in the base-6 expansion 
of n — 2 k , otherwise those tiles will form unused terminal assemblies. Each glue in a tile type 
representing a bit is "marked" indicating whether it is the most significant bit, least significant bit, 

4 The bumps cannot simply be placed with strength-2 glues above a width-1 or even width-2 representation of a 
bit in the obvious way, otherwise there would be nothing to force that the bumps are present before the inter-block 
strength-1 glues. If the bumps are allowed to grow in parallel with the rest of the assembly then they may not 
complete fast enough. Width four is required to create a "linear assembly path" for the bumps and dents tiles to 
follow, ensuring that growth of the block continues only once the path is complete. 
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Figure 3: Assembly of O(logn) x O(logn) size block from O( lo 1 ° 1 ^ n ) tile types. Every block starts from the 

same tile types that encode n — 2 k , using exactly k bits; in this example, n = 22 so k = 4 and n — 2 k is 0110 
in binary using 4 bits. Thick solid lines represent strength- 2 glues. For clarity, strength-1 glues are shown 
selectively to help verify that a certain order of growth is possible to enforce. The tiles encode n — 2 k in base 



b chosen to be a power of two such that 



log n 



< b < 2 7 



, — (labeled "seed" for intuition, although those 

log log n — log log n v 70 

tiles start unattached). A constant-size set of tile types does the rest. n—2 k is first converted to binary and 
each of its bits quadrupled to make room for the bumps and dents. Then x and y coordinates arc randomly 
guessed and simultaneously compared to n — 2 k ; if either is smaller, that dimension is doubled in length (in 
this example the height is doubled but not the width). At the same time, the values of x and y are compared 
as described in the proof of Theorem 3.1 to determine where to place strength-1 glues. 



or interior bit, as well as being marked with its relative position among the four copies of the bit. 

Once n — 2 k is converted to binary, we use nondeterministic attachment of tiles to the north of 
this value to randomly guess 2k bits that represent the x- and y-coordinates of the block, meaning 
the binary numbers represented on the top and right, respectively, of each block in Figure 1. To be 
precise, we must actually choose each of x and y to be a random bit string that is not all l's, since 
each represents a connection between two blocks, of which there are 2 k — 1 along each dimension. It 
is straightforward to encode into the tile types the logic that if the first k bits were 1 , then the final 
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bit must be 0. A number of additional computations are done on these values (some computations 
are possible to do as the values are guessed). The results of these computations will be stored 
in the rightmost tile type and propagated to all subsequent tile types. First, each of x and y is 
compared to n — 2 k to determine how large to make each dimension of the block. In the example 
of Figure 3, y < n — 2 k and x > n — 2 k , so the block is one "unit" wide and two "units" high. Also, 
the binary expansions of x and y are themselves compared to determine where to place strength- 1 
glues. After x and y are determined, to place bumps and dents on the left and bottom of the block, 
the values x — 1 and y — 1 must be computed, which requires assembling from least significant to 
most significant, so this is delayed until after the first 90-degree rotation shown in Figure 3. Once 
these values are computed, they are also used to determine placement of glues. The entire block is 
created by rotating either counter-clockwise (in the case of x even, as shown in the bottom right of 
Figure 3) or clockwise (in the case of x odd, not shown but the exact mirror image of the bottom 
right of Figure 3), placing bumps and dents and single-strength glues. The glues are placed in the 
order shown by the rotations, so that the last glue to be (potentially) placed is the top east-facing 
glue in the case of x even, or the top west-facing glue in the case of x odd. 

By inspection of Figure 1, it is routine to verify that the following rules can be used to determine 
placement of strength-1 glues. If x is even, then place two single-strength glues on the right edge. 
If x is odd, then place two single-strength glues on the left edge. For a natural number n, define 
t(n) to be the number of trailing l's in n's binary expansion. If t(x) > t(y), then place exactly one 
strength-1 glue on the top edge. If t(x) > t(y — 1), then place exactly one strength-1 glue on the 
bottom edge. If t{x — 1) < t{y) + 1 and x is even, then place exactly one strength-1 glue on the left 
edge. If t(x) < t(y) + 1 and x is odd, then place exactly one strength-1 glue on the right edge. 

Each of these computations (for placement of glues and for determining block dimensions) can 
be computed by a deterministic finite automaton whose input symbols represent tuples of bits from 
n — 2 k , x, x — 1, y — 1, and y. These automata can then be combined in a product construction and 
embedded into the tile types that accrete in the row above n — 2 k if only x and y are needed, and 
embedded into tile types that are placed after the first rotation if x — 1 or y — 1 is needed. Since the 
decision for placing glue on the top edge requires only x and y, this ensures that the decision for 
each glue placement can be made before the region containing the potential glue site is assembled. 

As shown in Figure 3, some padding with filler tiles is necessary to make the block a perfect 
rectangle. Also, some padding is needed in the case of a doubling of height or width, to ensure that 
the resulting assembly has height or width precisely twice that of the non-doubled version. □ 

4 Time Complexity in the Hierarchical Model 

In this section we define a formal notion of time complexity for hierarchical tile assembly systems. 
The model we use applies to both the seeded aTAM and the hierarchical aTAM. 

For hierarchical systems, our assembly time model may not be completely suitable since we 
make some potentially unrealistic assumptions. In particular, we ignore diffusion rates of molecules 
based on size and assume that large assemblies diffuse as fast as individual tiles. We also assume 
that the binding energy r necessary for a small tile t to attach stably to an assembly a is the 
same as the binding energy required for a large assembly /3 to attach stably to a, even though one 
would expect such large assemblies to have a higher reverse rate of detachment (slowing the net 
rate of forward growth) if bound with only strength r. However, from the perspective of our lower 
bound on assembly time, Theorem 5.2, these assumptions have the effect of making hierarchial 
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self-assembly appear faster. We show that even with these extra assumptions, the time complexity 
of hierarchical partial order systems is still no better than the seeded aTAM. However, caution is 
warranted in interpreting the upper bound result, Theorem 6.1, of a sublinear time assembly of 
a shape. As we discuss in Section 7, a plausible treatment of diffusion rates - together with our 
lower bound techniques based on low concentrations of large assemblies - may yield an absolute 
linear-time (in terms of diameter) lower bound on assembly time of hierarchical systems, so that 
Theorem 6.1 may owe its truth entirely to the heavily exploited assumption of equally fast diffusion 
of all assemblies. A reasonable interpretation of Theorem 6.1 is that the partial order assumption is 
necessary to prove Theorem 5.2 and that concentration arguments alone do not suffice to establish 
linear-time time lower bounds in general hierarchical systems. The techniques that weave together 
both "assembly parallelism" and "binding parallelism", as discussed in Section 1 and Section 6, 
may prove useful in other contexts, even though their attained speedup is modest. 

4.1 Definition of Time Complexity of Seeded Tile Systems 

We now review the definition of time complexity of seeded self-assembly proposed in [3] . A concen- 
trations function on a tile set T is a subprobability measure C : T — > [0, 1] (i.e., X^reT^( r ) — 
Each tile type r is assumed to be held at a fixed concentration C(r) throughout the process of 
assembly. 5 The assembly time for a seeded TAS T = (T, a, r) is defined by picking a copy of 
the seed arbitrarily and measuring the expected time before the seed grows into some terminal 
assembly, when assembly proceeds according to the following stochastic model. The assembly pro- 
cess is described as a continuous-time Markov process in which each state represents a producible 
assembly, and the initial state is the seed assembly a. For each pair of producible assemblies a, (3 
such that a — >\ /3 via the addition of tile type r, there is a transition in the Markov process from 
state a to state ft with transition rate C(r). 6 The sink states of the Markov process are precisely 
the terminal assemblies. The time to reach some terminal assembly from a is a random variable 
T^c, and the assembly time complexity of the seeded TAS T with concentrations C is defined to 
be J(T,C) = E[T T>C }. 

The requirement that the tile concentrations function C be a subprobability measure, rather 
than an arbitrary measure taking values possibly greater than 1, reflects a physical principle known 
as the finite density constraint, which stipulates that a given unit volume of solution may contain 
only a bounded number of molecules (if for no other reason than to avoid forming a black hole). 
By normalizing so that one "unit" of volume is the volume required to fit one tile, the total 
concentration of tiles (concentration defined as number or mass per unit volume) cannot exceed 1. 

5 For singly-seeded tile systems in which the seed tile s G T appears only once at the origin, this assumption is valid 
in the limit of low seed concentration C(s) compared to all other concentrations C(r) for r G T\ {s}. This is because 
the number of terminal assemblies (if each is of size at most K) will be limited by C(s), implying the percentage 
change in every other tile type r's concentration is at most K ■ C(s)/C(r); therefore "low" seed concentration means 
setting C(s) -C C(r)/K for all r G T\ {s}. In fact, to obtain an assembly time asymptotically as fast, one need only 
ensure that for all r, C(r) > 2#a(r)C(s), where #s(r) is the number of times r appears in the terminal assembly a. 
This guarantees that the concentration of t is always at least half of its start value, which means that the assembly 
time, each step of which is proportional to the concentration of the tile type attaching at that step, is at most doubled 
compared to the case when the concentrations are held constant. 

6 That is, the expected time until the next attachment of a tile to a is an exponential random variable with rate 
yj pg9a C(S(p)), where da is frontier of a, the set of empty locations at which a tile could stably attach to a. The prob- 
ability that a particular p' G da leading to assembly p is the next location of attachment is C(a(p'))/ ^2 p€da C(2(p)). 

7 When our goal is to obtain only an asymptotic result concerning a family of tile systems assembling a family of 
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We have the following time complexity lower bound for seeded systems. This theorem says 
that even for non-directed systems, a seeded TAS can grow its diameter only linearly with time. 
It strengthens and implies Lemma 4.6 of the full version of [3], which applied only to directed 
systems. 

Let d G Z + . Let T = (T, <r, r) be a singly-seeded TAS (meaning |cr| = 1), and let C : T — )■ [0, 1] 
be a concentrations function. Since it takes only constant time for the assembly to grow to any 
constant radius, restricting attention to singly-seeded systems does not asymptotically affect the 
result for tile systems with a finite seed assembly of size larger than 1. Let D(7~, C, d) be the 
random variable representing the time that any tile is first placed at distance d (in the L\ norm) 
from the seed, conditioned on the event that a tile is eventually placed at least distance d from the 
seed, with D(7~, C, d) = oo if all producible, terminal assemblies of T are completely contained in 
radius d — 1 around the seed. 

Theorem 4.1. For each d G Z + , each singly- seeded TAS T, and each concentrations function 



Proof. Since we care only about the first time at which a tile is attached at distance d (before which 
there are no tiles at distance d' for any d' > d), we can restrict the assembly process to the region 
of radius d around the seed. Therefore we model the assembly process as if it proceeds normally 
until the first tile attaches at distance d from the seed, at which point all growth immediately halts. 

Define R + = [0, oo). Given i G {0, . . . , d} and t G M + , let Xj(i) be a random variable denoting 
the number of tiles attached at locations with distance exactly i from the seed at time t, under 
the restriction stated above that all assembly halts the moment that a tile is placed at distance d. 
Then for all t G R + , the event X^(i) = (no tile is at distance d by the time t) is equivalent to the 
event D(T, C,d) > t (the time of the first attachment at distance d strictly exceeds t). 

In a seeded TAS, tiles can attach at a location only when there is another tile adjacent to the 
location. Locations at Li-distance i to the seed are only adjacent to locations at distance either 
i + 1 or i — 1 to the seed. Off the x- and y-axes, each location at distance i has two neighbors at 
distance i — 1 and two neighbors at distance and for the 4 locations at distance i on either axis, 
every location has one neighbor at distance i — 1 and three neighbors at distance i + 1. Therefore, 
at time t, tiles are attachable to at most 3Xj_i(i) + 2Xj+i(i) different locations with distance i to 
the seed. Since the total concentration of any single tile type is at most 1, the rate at which tiles 



The lack of a 2fd+i(t) term in the latter inequality is due to our modification of the assembly 
process to immediately halt once the first tile attaches at distance d. Since the assembly process 
always starts with a single seed tile, fo(t) = 1 for all t G M + , and /«(0) = for all i G {0, . . . , d}. 
For all t G M + and all i G {1, .. . ,d}, fi(t) < 4i since there are exactly 4i locations at distance 
exactly i to the seed. 

Let to G R + be the unique time at which fd(to) = \- This time is unique since fd is monotonically 
increasing. Since fd(to) = E[X^(io)]) by Markov's inequality, Pr[Xd(io) > 1] < \, implying that 

assemblies of size/diameter N, we may relax the finite density constraint to the requirement that the concentrations 
sum to a constant c 6 R + independent of N, since these concentrations could be divided by c to sum to 1 while 
affecting the assembly time results by the same constant c, leaving the asymptotic results unaffected. 



C :T -> [0,1], E[D(T,C,d)] =fi(d). 




attach at any given location is at most 1. For all i G 




< 3/d_i(t). 
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Pr[Xrf(to) < 1] > 9- Since X^ is integer- valued and nonnegative, this is equivalent to stating that 
Pr[X d (i ) = 0] > |. Recall that X d (t ) = <^ D(T, C, d) > t , whence Pr[D(T, C, d) > t ] > \. 
By Markov's inequality, E [D(T,C,d)\ > f . Thus it suffices to prove that to > £l(d). To do this, 
we define a simpler function that is an upper bound for f d and solve its differential equations. 

For all i 6 {0, ... , d}, define the function gi : ]R + — > M + (which will serve as an upper bound for 
fi) as follows. For all 1 G {1, . . . , d — 1}, 

= 3 gi -x(t) + 2g l+1 {t), when 9i {t) < Ad, 

dg d {t) 
dt 

and for all i G {0, . . . , d}, 

dgi(t) 



3g d -i(t), when g d (t) < id, 
= 0, when gi(t) = Ad, 



dt 

with the boundary conditions go(t) = Ad for all t G M + , <?j(0) = for all i G {1, . . . , d}. It is routine 
to check that gi(t) > fi(t) for all i G {0, . . . , d} and t G M + . Furthermore, if ffj(io) > gi+i^o) for 
all i G {0, . . . , d} and a specific time point to £ I^ + > then 

rf gi(*) ^ dg i+1 (t) 

— 77 — > Tj. at tlme *0, 

dt dt 

Since gi(0) > <7i+i(0) for all i € {0, ...,d} by definition, we know that gi{t) > <7j+i(t) for all 
i G {0, . . . ,d} and all t G M + . Using this monotonicity, we can define a set of functions /ij(i) that 
are upper bounds for gi{t) by the following. 

dhj(t) , . „ r „ dhn(t) 

= 5hi-i(t), for all i € {1,... ,d}, and — ^ = 0, 

Ci(/ til/ 

with boundary conditions ho(t) = Ad for all t G IR + , /ij(0) = for all i G {0, . . . ,d}. Solving 
these differential equations, we obtain h d (t) = ^(5t) d . Letting t' = by Stirling's inequality 
d\ > V2^d (^) d e i/(i2d+i) > (^ we have 

Ad . A d Ad ( d\ d Ad ( d\ d Ad 



f4t / )< 9 4t')<h d (t') = ^.(5t') d = ^-(^) < 



2e 2 d ' 



Since is monotonically increasing, f d (to) = ^ by definition, and |^ < ^ for sufficiently large d, 
this implies that to > t' = □ 

4.2 Definition of Time Complexity of Hierarchical Tile Systems 

To define time complexity for hierarchical systems, we employ more explicitly the chemical kinetics 
that implicitly underlie the time complexity model for seeded systems stated in Section 4.1. We 
treat each assembly as a single molecule. If two assemblies a and (3 can attach to create an assembly 
7, then we model this as a chemical reaction a + /3 — > -y, in which the rate constant is assumed to 
be equal for all reactions (and normalized to 1). In particular, if a and f3 can be attached in two 
different ways, this is modeled as two different reactions, even if both result in the same assembly. 8 



8 The fact that some directed systems may not require at least one of these attachments to happen in every terminal 
assembly tree is the reason we impose the partial order requirement when proving our time complexity lower bound. 
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At an intuitive level, the model we define can be explained as follows. We imagine dumping 
all tiles into solution at once, and at the same time, we grab one particular tile and dip it into 
the solution as well, pulling it out of the solution when it has assembled into a terminal assembly. 
Under the seeded model, the tile we grab will be a seed, assumed to be the only copy in solution 
(thus requiring that it appear only once in any terminal assembly). In the seeded model, no 
reactions occur other than the attachment of individual tiles to the assembly we are holding. In the 
hierarchical model, other reactions are allowed to occur in the background (we model this using the 
standard mass-action model of chemical kinetics [16]), but only those reactions with the assembly 
we are holding move us "closer" to completion. The other background reactions merely change 
concentrations of other assemblies (although these indirectly affect the time it will take our chosen 
assembly to complete, by changing the rate of reactions with our chosen assembly). 

One reason for choosing this model is that we would like to analyze the assembly time in 
such as way as to facilitate direct comparison with the results of [3]. In particular, we would 
like the assembly time model proposed in [3] to be derived as a special case of the model we 
propose, when only single-tile reactions with the seed-containing assembly are allowed. With a 
model such as Gillespie's algorithm [21-23] using finite molecular counts, it is possible that no 
copy of the terminal assembly forms, so it is not clear how to sensibly ask how long it takes to 
form. 9 The mass-action model of kinetics [16] describes concentrations as a dynamical system that 
evolves continuously over time according to ordinary differential equations derived from reaction 
rates. This is an accepted model of kinetics when molecular counts are very large, which is already 
an implicit assumption in the standard aTAM. In the mass-action model, all possible terminal 
assemblies (assuming there are a finite number of different terminal assemblies) are guaranteed to 
form, which resolves one issue with the purely stochastic model. But the solution goes too far: 
some (infinitesimal) concentration of all terminal assemblies form in the first infinitesimal amount 
of time. A sensible way to handle this may be to measure the time to half-completion (time required 
for the concentration of a terminal assembly to exceed half of its steady-state concentration. But 
this model is potentially subject to "cheats" such as systems that "kill" all but the fastest growing 
assemblies, so as to artificially inflate the average time to completion of those that successfully 
assemble into the terminal assembly. Furthermore, it would not necessarily be fair to directly 
compare such a deterministic model with the stochastic model of [3]. 

The model of assembly time that we define is a continuous-time stochastic model similar to that 
of [3]. However, rather than fixing transition rates at each time t £ M + as constant, we use mass- 
action kinetics to describe the evolution over time of the concentration of producible assemblies, 
including individual tile types, which in turn determine transition rates. To measure the time to 
complete a terminal assembly, we use the same stochastic model as [3] , which fixes attention on one 
particular tile and asks what is the expected time for it to grow into a terminal assembly, where 
the rate of attachment events that grow it are time-dependent, governed by the continuous mass- 
action evolution of concentration of assemblies that could attach to it. Unlike the seeded model, 
we allow the tile concentrations to deplete, since it is no longer realistic (or desirable for nontrivial 
hierarchical constructions) to assume that individual tiles do not react until they encounter an 
assembly containing the seed. 10 

9 This problem is easily averted in a seeded system by setting the seed concentration sufficiently low (but still 
fi(|T|)) to ensure that the terminal assembly is guaranteed to form at least one copy. In a hierarchical system it is 
not clear how to avoid this problem. 

10 However, this depletion of individual tiles is not the source of our time lower bound. Suppose that we used a 
transition rate of 1 for each attachment of an individual tile (which is an upper bound on the attachment rate even 
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We first formally define the dynamic evolution of concentrations by mass-action kinetics. Let 
T = (T,t) be a hierarchical TAS, and let C : T — > [0,1] be a concentrations function. Let 
E + = [0,oo), and let t G M + . For a G «4[7], let [a]c(i) (abbreviated [a](t) when C is clear from 
context) denote the concentration of a at time t with respect to initial concentrations C, defined as 
follows. 11 We often omit explicit mention of C and use the notation [r](0) to mean C(r), for r G T, 
to emphasize that the concentration of ?" is not constant with respect to time. Given two assemblies 
a and f3 that can attach to form 7, we model this event as a chemical reaction R : a + /3 — > 7. Say 
that a reaction a + /5 — > 7 is symmetric if a = /3. Define the propensity (a.k.a., reaction rate) of 
at time t G M + to be pn(t) = [a](t) ■ [(3](t) if R is not symmetric, and PR.(t) = \ ■ [a](t) 2 if R is 
symmetric. 12 

If a is consumed in reactions a + /3i — > 71 , . . . , a + /3 n — > 7 n and produced in asymmetric 
reactions (3[ + j{ — > a, . . . , + j' m — > a and symmetric reactions j3'{ + j3" — > a, . . . , /3p + /3 p ' — » a, 
then the concentration [ct](t) of a at time t is described by the differential equation 



d[a](t) 



m P 1 n 

EW) • foflw + E 9 • i^](*) a - Enw • (4.1) 



i=l i=l i=l 

with boundary conditions [a](0) = C(r) if a is an assembly consisting of a single tile r, and 
[a](0) = otherwise. In other words, the propensities of the various reactions involving a determine 
its rate of change, negatively if a is consumed, and positively if a is produced. 

The definitions of the propensities of reactions deserve an explanation. Each propensity is 
proportional to the average number of collisions between copies of reactants per unit volume per 
unit time. For a symmetric reaction (3" + /3" — > a, this collision rate is half that of the collision 
rate compared to the case where the second reactant is a distinct type of assembly, assuming it 
has the same concentration as the first reactant. 13 Therefore the amount of a produced per unit 
volume per unit time is half that of a corresponding asymmetric reaction. The reason that terms of 
symmetric reactions + 0—7-7 that consume a are not corrected by factor | is that, although the 
number of such reactions per unit volume per unit time is half that of a corresponding asymmetric 
reaction, each such reaction consumes two copies of a instead of one. This constant 2 cancels out 
the factor ^ that would be added to correct for the symmetry of the reaction. Therefore, the term 
[a](t) ■ [(3i](t) representing the rate of consumption of a is the proper value whether or not a = /3j. 



for seeded systems due to the finite density constraint) and dynamic transition rates only for attachment of larger 
assemblies. Then the assembly of hierarchical partial order systems still would proceed asymptotically no faster than 
if single tile attachments were the only reactions allowed (as in the seeded assembly case), despite the fact that all the 
single-tile reactions at the intersection of the seeded and hierarchical model would be at least as fast in the modified 
hierarchical model as in the seeded model. 

11 More precisely, [a](t) denotes the concentration of the equivalence class of assemblies that are equivalent to a 
up to translation. We have defined assemblies to have a fixed position only for mathematical convenience in some 
contexts, but for defining concentration, it makes no sense to allow the concentration of an assembly to be different 
from one of its translations. 

12 That is, all reaction rate constants are equal to f. To the extent that a rate constant models the "reactivity" of 
two molecules (the probability that a collision between them results in a reaction), it seems reasonable to model the 
rate constants as being equal. To the extend that a rate constant also models diffusion rates, this assumption may 
not apply; we discuss the issue in Section 7. Since we are concerned mainly with asymptotic results, if rate constants 
are assumed equal, it is no harm to normalize them to be 1. 

13 For intuition, consider finite counts: with n copies of 7 and n copies of /3 7^ 7, there are n 2 distinct pairs of 
molecules of respective type 7 and /3, but with only n copies of 7, there are r^ ( n ~ 1 ' , distinct pairs of molecules of type 
7, which approaches |n 2 as n — > 00. 
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This completes the definition of the dynamic evolution of concentrations of producible assem- 
blies; it remains to define the time complexity of assembling a terminal assembly. Although we 
have distinguished between seeded and hierarchical systems, for the purpose of defining a model of 
time complexity in hierarchical systems and comparing them to the seeded system time complexity 
model of [3], it is convenient to introduce a seed-like "timekeeper tile" into the hierarchical system, 
in order to stochastically analyze the growth of this tile when it reacts in a solution that is itself 
evolving according to the continuous model described above. The seed does not have the purpose 
of nucleating growth, but is introduced merely to focus attention on a single molecule that has not 
yet assembled anything, in order to ask how long it will take to assemble into a terminal assembly. 1 
The choice of which tile type to pick will be a parameter of the definition, so that a system may 
have different assembly times depending on the choice of timekeeper tile. 

Fix a copy of a tile type s to designate as a "timekeeper seed". The assembly of s into some 
terminal assembly a is described as a time-dependent continuous-time Markov process in which 
each state represents a producible assembly containing s, and the initial state is the size-1 assembly 
with only s. For each state a representing a producible assembly with s at the origin, and for each 
pair of producible assemblies /3, 7 such that a + (3 — > 7 (with the translation assumed to happen 
only to /3 so that a stays "fixed" in position), there is a transition in the Markov process from 
state a to state 7 with transition rate [/?](£)• 15 Unlike the seeded model, the transition rates vary 
over time since the assemblies (including assemblies that are individual tiles) with which s could 
interact are themselves being produced and consumed. 

We define T^ c^ to be the random variable representing the time taken for the copy of s to 
assemble into a terminal assembly via some sequence of reactions as defined above. We define the 
time complexity of a directed hierarchical TAS T with concentrations C and timekeeper s to be 
T(T, C, s) = E [Tf t c tS \- It is worth noting that this expected value could be infinite. This would 
happen if some partial assembly a, in order to complete into a terminal assembly, requires the 
attachment of some assembly /3 whose concentration is depleting quickly. 

We note in particular that our construction of Theorem 3.1 is composed of (j^j^) 2 different 
types of O(logn) x O(logre) "blocks" that can each grow via only one reaction. At least one of 
these blocks /3 must obey [/?] (t) < lo ^ n for all t G R + . This implies that the rate of the slowest 
such reaction is at most n . Thus our square construction assembles in at least "a ) time, 
slower than the optimal seeded time of O(n) [3]. Proving this formally requires more details that 
we omit, but the fact that the system is a partial order system as defined in Section 5.1 means that 
Theorem 5.2 implies a time complexity lower bound of J7(n). 

14 For our lower bound result, Theorem 5.2, it will not matter which tile type is selected as the timekeeper, except 
in the following sense. We define partial order systems, the class of directed hierarchical TAS's to which the bound 
applies, also with respect to a particular tile type in the unique terminal assembly. A TAS may be a partial order 
system with respect to one tile type but not another, but for all tile types s for which the TAS is a partial order 
system, the time lower bound of Theorem 5.2 applies when s is selected as the timekeeper. The upper bound of 
Theorem 6.1 holds with respect to only a single tile type, which will appear at a unique position in the final assembly. 

15 That is, for the purpose of determining the continuous dynamic evolution of the concentration of assemblies, 
including a, in solution at time t, the rate of the reaction a + /3 — > 7 at time t is assumed to be proportional to 
[a] (t) [/?](£) (or half this value if the reaction is symmetric). However, for the purpose of determining the stochastic 
dynamic evolution of one particular copy of s, the rate of this reaction at time t is assumed to be proportional only 
to [/3](t). This is because we want to describe the rate at which this particular copy of a, the one containing the copy 
of s that we fixed at time 0, encounters assemblies of type f3. This instantaneous rate is independent of the number 
of other copies of a at time t (although after e seconds the rate will change to [/3](t + e), which of course will depend 
on [a] over that time interval). 
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5 Time Complexity Lower Bound for Hierarchical Partial Order 
Systems 

In this section we show that the subset of hierarchical TAS's known as partial order systems cannot 
assemble any shape of diameter N in faster than time Q(N). 

5.1 Definition of Hierarchical Partial Order Systems 

Seeded partial order systems were first defined by Adleman, Cheng, Goel, and Huang [3] for the 
purpose of analyzing the running time of their optimal square construction. Intuitively, a seeded 
directed TAS with unique terminal assembly a is a partial order system if every pair of adjacent 
positions p\ and p2 in a that interact with positive strength have the property that either p\ 
always receives a tile before p2, or vice versa. We extend the definition of partial order systems to 
hierarchical systems in the following way. 

Let T = (T,t) be a hierarchical directed TAS with unique terminal assembly a E .4rj[7~]. Let 
T be any terminal assembly tree of T ■ Let p G dom a and let s = a(p). The assembly sequence 
with respect to T starting at p is the sequence of assemblies 5 Pi t = («i> • • • , «fe) that represent the 
path from the leaf corresponding to p to the root of T, so that ct\ is the single tile s at position 
p, and cif. = a. 16 An assembly sequence starting at p is an assembly sequence with respect to T 
starting at p, for some valid assembly tree T. 

An attachment quasiorder with respect to p G dom a is a quasiorder (a reflexive, transitive 
relation) ^ on dom a such that the following holds: 

1. For every pi,p2 G dom a, if and only if p\ -< p2, then for every assembly sequence a = 
(a±, . . . ,ak) starting at p, for all 1 < i < k, (ai(p2) is defined ==> Oi(pi) is defined). In 
other words, p\ must always have a tile by the time p2 has a tile. (Perhaps they always arrive 
at the same time in every assembly sequence starting at p.) 

2. For every pair of adjacent positions p\,P2 G dom a, if the tiles at positions p\ and p2 interact 
with positive strength in 3, then p\ -< p2 or p2 ^ p\ (or both). 

If two tiles always arrive at the same time to the assembly containing p, then they will be in the 
same equivalence class induced by ^. Given an attachment quasiorder ^, we define the attachment 
partial order -< induced by H to be the strict partial order on the quotient set of equivalence 
classes induced by ^. In other words, if some subassembly a C a always attaches to the assembly 
containing p all at once, then all positions p' G dom a will be equivalent under ■<. 11 It is these 
equivalence classes of positions that are related under -<. Note that the assembly a\ containing p 
is always size 1, since by definition p is the only position with a tile at time 0. 

We say that a directed hierarchical TAS 7~ with unique terminal assembly a is a hierarchical 
partial order system with respect to p if it has an attachment quasiorder with respect to p. Each 
attachment partial order -< induces a directed acyclic graph G = (V, E), where V = . . . , /3k}, 
each Pi represents the subassembly corresponding to some equivalence class (under ^) of positions 

16 That is, a is like a seeded assembly sequence in that each on is a subassembly of aj+i (written on C Qi+i, meaning 
dom ai C dom on+i and on{p) — on+i(p) for all p £ dom a;). The difference is that on and on+i may differ in size by 
more than one tile, since dom \ dom an will consist of all points in the domain of aj's sibling in T. 

17 More generally, if there is a subset X C dom S such that all assemblies a attaching to the assembly containing 
p have the property that dom ctd X ==> X C dom a. 
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in dom a, and (/3i,/3j) G E if dom -< dom f3j. Given a tile type s G T, we say that 7" is a 
hierarchical partial order system with respect to s if, for all p G dom a such that a(p) = s, T is a 
hierarchical partial order system with respect to p. 

In the case of seeded assembly, in which each attachment is of a "subassembly" containing a 
single tile to a subassembly containing the seed, this definition of partial order system is equivalent 
to the definition of partial order system given in [3]. In the seeded case, since no tiles may attach 
simultaneously, the attachment quasiorder we have defined induces equivalence classes that are 
singletons, and the resulting induced strict partial order is the same as the strict partial order used 
in [3]. It is routine to check that the TAS described in Section 3 is a partial order system, using 
one of the orange tiles in the upper-left of Figure 3 as a "seed." 

5.2 Linear Time Lower Bound for Partial Order Systems 

Theorem 5.2 establishes that hierarchical partial order systems, like their seeded counterparts, 
cannot assemble a shape of diameter N in less than fi(JV) time. Intuitively, this is proven by 
using the fact that an attaching assembly of size K, which is able to increase the size of the 
growing assembly by K tiles in a single step, can have concentration at most by conservation 
of mass, slowing down its rate of attachment (compared to the rate of a single tile) by factor at 
least K, precisely enough to cancel out the potential speedup over a single tile due to its size. This 
simplistic argument is not quite accurate and must be amortized - using our Conservation of Mass 
Lemma (Lemma 5.1) - over all assemblies that could extend the growing assembly since the growing 
assembly may be extended at a single attachment site by more than one assembly, but the more 
assemblies that could extend it, the less the total concentration of the assemblies that could attach 
to create the extension. Intuitively, the property of having a partial order on binding subassemblies 
ensures that the assembly of each path in the partial order graph proceeds by a series of rate- 
limiting steps. We prove upper bounds on each of these rates using this concentration argument. 18 
Since the rate-limiting steps must occur in order, we can then use linearity of expectation to bound 
the total expected time. 

The following is a "conservation of mass lemma" that will be helpful in the proof of Theorem 5.2. 
Note that it applies to any hierarchical system. 

Lemma 5.1 (Conservation of Mass Lemma). Let T = (T,t) be a hierarchial TAS and let 
C : T — > [0, 1] be a concentrations function. Then for all t G M + , 

£ [a](t)-\a\=^C(r) (< 1). 

a€A[T\ r&T 

Proof. For all t G M + , define fit) = J2aeA[T] [ a K0 ' \ a \- According to our model, [a](0) = C(r) 
if a consists of a single tile type r and [a](0) = otherwise, so /(0) = YlreT C( r )- Therefore it 
is sufficient (and necessary) to show that % = 0. For all a G «4[T] and t G M + , define f a {t) = 

18 The same assembly a could attach to many locations pi, . . . , p n . In a TAS that is not a partial order system, it 
could be the case that there is not a fixed attachment location that is necessarily required to complete the assembly. 
In this case completion of the assembly might be possible even if only one of pi, . . . ,p n receives the attachment of 
a. Since the minimum of n exponential random variables with rate 1/K is itself exponential with rate n/K, the 
very first attachment of a to any of pi, . . . ,p n happens in expected time K/n, as opposed to expected time K for a 
to attach to a particular pi. This prevents our technique from applying to such systems, and it is the fundamental 
speedup technique in our proof of Theorem 6.1. 
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[a](t) ■ \a\. Then by equation (4.1), and recalling from that equation the definitions of m, n, p, (3[, 
(3", 7-, and /3j, annotated as m(a), n(a), etc. to show their dependence on a, we have 
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Let 7?. denote the set of all attachment reactions of T, writing R(a, f3, 7) to denote the reaction 
a + /3 — > 7. For each such reaction, |a| + |/3| = |7|. In particular, if a = (5, then I7I = 2\a\. Each 
such asymmetric reaction contributes precisely three unique terms in the right hand side above: 
two negative (of the form —\a\ ■ [a](t) ■ \P](t) and — • [a](t) ■ [f3](t)) and one positive (of the form 
ItI ' ' Each such symmetric reaction contributes two unique terms: one negative (of 

the form — |q| • [a](i) 2 ) and one positive (of the form |7| • | • [a](i) 2 ). 



Then we may rewrite the above sum as 
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Theorem 5.2. Lei T = (T,t) be a hierarchial partial order system with respect to s E T, with 
unique terminal assembly a of L\ diameter N. Then for all concentration functions C : T — > [0, 1], 
T(T,C,s) = U(N). 

Proof. Let a S .4a [T] be the unique terminal assembly of T, and let p G dom a be such that 
a{p) = s. Let ■< be the attachment quasiorder testifying to the fact that T is a partial order 
system with respect to p. Let -< be the strict partial order induced by -<. Let G = (V,E) be the 
directed acyclic graph induced by -<. Assign weights to the edges of E by w(cti, aj) = \ctj\. 

If P' = (at'i, . . . ,a'i) is any path in G, define the (weighted) length of P to be w(P') = 
Yli=i w ( a 'ii a 'i+i)- Let q, r G dom a be two points at L\ distance N, which must exist since 
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the diameter of dom a is N. Without loss of generality, the distance d from p to q in dom 3 is 
at least N/2. Let i be such that q G dom on, and let P = (a\, . . . ,a n ) be any path in G starting 
with oi\ and ending with a n , where a\ is the assembly consisting just of the tile at position p. The 
weight of each edge in P is an upper bound on the diameter of the assembly it represents, and 
the sum of these diameters for all on in P is itself an upper bound on the distance from p to q. 
Therefore w{P) > N/2. 

Let Tp be the random variable representing the time taken for the complete path P to form. 
Since the tiles on P represent a subassembly of a, a cannot completely form until the path P forms. 
Therefore T P < T r ,c, P - Since T(T, C,p) = E [T T>c ,p], it suffices to show that E [T P ] = fl{w{P)). 

Because of the precedence relationship described by -<;, no portion of the path P can form until 
its immediate predecessor on P is present. After some amount of time, some prefix P' of the path 
P has assembled (possibly with some other portions of a not on the path P). Given t G R + , let 
L(t) be the random variable indicating the weighted length of this prefix after t units of time. 

We claim that for all t G IR + , E [L(t)] < t. Define the function / : M+ -» R+ for all t G M + by 
f(t) = E [L(i)], noting that /(0) = 0. Let f = %■ Let P' = (a 1} . . . , a m ) be the prefix of P formed 
after t seconds. Let fix, fin ■ ■ ■ i Pki with m + k = n, be the individual subassemblies remaining on 
the path, in order, so that P = (ai, . . . , a m , /3i, . . . For all 1 < i < k, let ji = U}=i Pj be the 
union of the next i such subassemblies on the path (representing each of the amounts by which P 
could grow in the next attachment event). Let = |^| be the size of the i th subassembly, and let 
aeAi(t) [ a K*)j where Ai(t) is the set of subassemblies (possibly containing tiles not on the 
path P) at time to that contain ji but do not contain 7i+i- Ai(t) represents the set of all assemblies 
that could attach to grow P by exactly the tiles in ji . Our argument uses the conservation of mass 
property (Lemma 5.1) to show that no matter the concentration of assemblies in each Ai(t), the 
rate of growth is at most one tile per unit of time. 

For each 1 < i < k, in the next instant dt, with probability c%dt the prefix will extend by 
total weighted length Sj by attachment of (a superassembly containing) 73. This implies that 
f'(t) — Si=i ' s i- Invoking Lemma 5.1, it follows that for all t G M + , 

k 

Since /(0) = 0, this implies that fit) < t for all t G M + , which completes the proof of the claim 
that E [L(t)] < t. 

Since E [L(t)] < t, by Markov's inequality, Pr[L(i) > 2t] < \. Letting t = w(P)/2, the event 
L(iu(P)/2) > w(P) is equivalent to the event T P < w(P)/2. Thus Pr[T P < w(P)/2] < \. By 
Markov's inequality, E [Tp] > u;(P)/4 = £l(w(P)) = Cl(N), which proves the theorem. □ 

Although our assembly time model describes concentrations as evolving according to the stan- 
dard mass-action kinetic differential equations, Lemma 5.1 is the only property of this model that 
is required for our proof of Theorem 5.2. Even if concentrations of attachable assemblies (and thus 
their associated attachment rates in the Markov process defining assembly time) could be magi- 
cally adjusted throughout the assembly process so as to minimize the assembly time, so long as the 
concentrations obey Lemma 5.1 at all times, Theorem 5.2 still holds. 
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6 Assembly of a Shape in Time Sublinear in its Diameter 



This section is devoted to proving the following theorem, which shows that relaxing the partial 
order assumption on hierarchical tile systems allows for assembly time sublinear in the diameter of 
the shape. 

Theorem 6.1. For infinitely many n£N, there is a (non- directed) hierarchical TAS T = (T,2) 
that strictly self-assembles an n x n' rectangle, where n > n' , such that \T\ = O(logra) and there is 
a tile type s G T and concentrations function C : T — > [0, 1] such that T(T, C, s) = 0(n 4//5 log n). 

In our proof, n! ~ n 3 / 5 , but we care only that n' < n so that the diameter of the shape is Q(n). 
As discussed in Section 4, we interpret the upper bound of Theorem 6.1 more cautiously than 
the lower bound of Theorem 5.2, since some of our simplifying assumptions concerning diffusion 
rates and binding strength thresholds, discussed in Section 7, may cause the assembly time to 
appear artificially faster in our model than in reality. A reasonable conclusion to be drawn from 
Theorem 6.1 is that concentration arguments alone do not suffice to show a linear-time lower bound 
on assembly time in the hierarchical model. 

Although we use mass-action kinetics to model changing concentrations, we occasionally use 
discrete language to describe the intuition behind reactions - e.g., "a copy of A is consumed and 
two copies of B are produced" - despite the fact that concentrations model continuously evolving 
real- valued concentrations. 

The hierarchical model will permit a speed-up over the seeded model. However, when viewed 
as a programming language for tile assembly, the hierarchical model is more unwieldy to program 
and to analyze. Therefore we prove a number of lemmas showing that careful design of hierarchical 
tiles will cause them to "behave enough like" seeded tiles to remain tractable for analysis, and 
to ensure that the assembly proceeds sufficiently quickly. Much like parallel programming, in 
which critical regions are segregated into a few well-characterized parts of the program, we largely 
employ "seeded-like assembly" for most subcomponents of the construction, combining them using 
hierarchical parallelism at a small number of well-understood points. 

6.1 Warm-Up: A Thin Bar 

We first "warm up" by analyzing in detail the assembly time of a simple but non-trivial system. 
The following lemma, Lemma 6.2 (more precisely, its corollary, Corollary 6.3 that assigns concrete 
concentrations to the tile types), shows that it is possible to grow a "substantial" concentration 
of a "hard-coded thin bar" in quadratic time under the mass-action model. This structure will 
be the first subassembly formed in many other subcomponents of the tile system of Theorem 6.1. 
Furthermore, the proof of Lemma 6.2 will illustrate several techniques for analyzing hierarchical 
assembly time (and "programming tricks" to ensure that this time is fast). To achieve quadratic 
time we use "polyomino-safe" tiles that grow a 2 x n bar in a 7X^-7,^ fashion to enforce that 
no substantial grow nucleates except at the "seed", similar to the zig-zag tile set described by 
Schulman and Winfree [40, 41] (which prevented spurious nucleation with high probability under 
the more permissive kinetic tile assembly model [50] that allows reversible attachments and strength- 
1 attachments). This enforces that no large overlapping subassemblies grow that would compete 
to consume tiles without being able to attach to each other (which happens using the naive n tile 
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types required to grow a 1 x n bar). 



Lemma 6.2. Let T = (T,2) be the hierarchical TAS shown in Figure 4, and let a be its unique 
terminal assembly of a 2 x n bar. Let the initial concentrations be defined by [si](0) = 5 > 0, 
[rj](0) = 25 for all i G {1, . . . ,n}, and [s;](0) = 35 for all i G {2, . . . ,n}. Then for all t > ^f, 
[a](t) > §. 

Figure 4: Tile types that grow a 2 x n bar. Thick lines represent strength-2 glues, and thin lines represent 
strength- 1 glues. Glues are not labeled, but each is hard-coded to represent its position in the final assembly. 
Zig-zag growth enforces that no overlapping subassemblies larger than size 2 can occur. All growth other 
than size-2 dimers (which will attach just as correctly as if they had stayed monomers) must nucleate from 
the "seed" labeled si. 

Intuitively, the reason for the choice of concentrations is to approximate the speed of seeded 
single-tile addition assembly, by enforcing that the concentrations of individual tiles (or dimers) 
other than si remain for all time above at least a fixed constant 5, to keep high their rate of reaction 
with a larger assembly containing "preceding" tile types. 

Proof. There are two types of producible assemblies containing si', a, for i G {1, . . . , n}, which has 
exactly 2i tiles, and ft for % G {1, . . . , n}, which has exactly 2i — 1 tiles. Hence j3\ contains only si , 
and a n = a. 

For all i G {1, . . . ,n}, let d{ denote the dimer (2-tile assembly) consisting of just and s»+i. 
Since ft contains s\, for all t G M + , [A](£) < [si](0) = 5. Thus at most 5 of the r,'s can bind 
to f3i. The remainder must stay unbound or bind to Sj+i to form di. Since |Vj](0) = 25 for all 
i G {1, . . . , n — 1} and all t G M + , 

[di){t) + [n](t)>5, (6.1) 
Also note that since there is no d n , we have 

[r n ](t)>5 (6.2) 

for all t G M + by the same reasoning. By similar reasoning, for i G {2, . . . ,n}, since [si](0) = 35, 
no more than 5 of Sj can bind to a«_i to become part of ft, and no more than 5 of the remaining 
can bind to copies of rj_i that never attach to ft-i- Thus for all i G {2, . . . , n} and all t G M + , 

[«<](*) > 5- (6.3) 

19 Assembling a 1 x n bar provably requires Q(nlogn) time to reach half of its steady-state concentration [4], 
even when all n tile types are allowed to have concentration as high as 1, exceeding the bound of the finite density 
constraint. Enforcing the finite density constraint and assigning each tile type a concentration of 1/n gives a time 
bound of Q{n 2 logn). 
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Let 



s(t) = X)N-[«i](t) + J2m-[m) 

i=l i=l 
n n 

= 5>[a,](t) + £(2i -!)[#](*). 



i=l i=l 

S(t) can be thought of as the total "mass" of tiles that belong to an assembly containing s\ at time 
t. Observe that 

sup S(t) = 2n5, (6.4) 

with the supremum 2nd attained only in the limit as t — > oo, when all s\ belong to an assembly a n . 
The reactions + Sj+i — > di do not change S(t). All other reactions (which are precisely those of 
the seeded model) increase S(t). Each reaction that increases S(t) is of the form cti + Sj + i — > /3j+i, 
Pi+^i — > oii (each of which increases S(t) by 1 for each unit concentration of the product produced), 
or f3i + di — > fii+i (which increases S(t) by 2). Therefore, summing the propensities of all these 
reactions, we obtain 

m/,\ n—l n n-1 

= EMM • kuK*) + • N(t) + E 2 • • [*](*)■ 

i=l i=l i=l 

By (6.3), for all i G R + , the propensity of reaction (oti + Sj+i — > /3j+i) is at least <5[aj](t) for all 
i G {1, ... ,ra — 1}. By (6.1), for all t G M + , the propensity of reaction (/3j + — > Qj) plus the 
propensity of reaction (/3j + — > /3«+i) is at least 5[(3i](t) for all i G {1, . . . , n — 1}. By (6.2), for 
all t G M + , the propensity of reaction (/3 n + r n , — > a n ) is at least 5[/3 n ,](t). 

Combining these bounds together, and noting that for all t G R + , [si](0) = X^=i + [/%](*)) 

(each right-hand side term representing an assembly that could contain a copy of s±), we obtain 



dS(t) 
dt 



n—l n—l 



[M*) • [r„](t) + ENW • + £[#](*) • + 2 • [*](«)) 



«=i i=l 

n-l \ 



> * [/5n](t)+E([^W + ^]( t )) 



V i=l / 

= 5([ Sl }(0)-[a n }(t)) = 5 2 -5[a n ](t). (6.5) 

Since a n is not a reactant in any reaction, [a n ](0 is monotonically increasing. Thus it suffices to 
prove that [ctn](x) > f ■ Suppose for the sake of contradiction that [a n ](^p) < f . Then [a n ](i) < | 
for all t < ^ by the monotonicity of [a„](t). By this bound and (6.5), ^ > § for all t < 
Since 5(0) = 5, this means that S(^) >5 + ^-^ = (2n + 1)5, which contradicts (6.4). □ 

The following corollary shows that if we pick the initial concentrations to be maximal subject 
to the finite density constraint and the constraints of Lemma 6.2, then quadratic time is sufficient 
to obtain terminal assembly concentration that is at least inversely linear. By Lemma 5.1, any 
producible assembly a obeys [a](i) < t^t for all t G K + , so this concentration bound is optimal 
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to within a constant factor. The time bound is asymptotically suboptimal 20 but sufficient for our 
purposes, since we only use hard-coded thin bars that are logarithmically smaller than the final 
assembly. 

Corollary 6.3. Let T = (T, 2) be the hierarchical TAS shown in Figure 4> an d ^ 2 be its unique 
terminal assembly of a 2 x n bar. Let the initial concentrations be defined by [si](0) = 5 = — for 
some constant c G M + , [ri\(0) = 25 for all i G {1, . . . ,n}, and [sj](0) = 35 for all i G {2, . . . , n}. 21 
Then for all t > Acn 2 , [a](t) > 

The next lemma is a discrete version of Corollary 6.3, which shows that selecting s\ as the 
timekeeper results in quadratic-time assembly of the bar under our stochastic assembly time model. 

Lemma 6.4. Let T = (T, 2) be the hierarchical TAS shown in Figure 4, an d define a concentra- 
tions function C : T — > [0,1] as in the statement of Corollary 6.3, setting C(s±) = 5 = — for 
some constant c G M + , C(rj) = 25 for i G {1, . . . , n}, and C(sj) = 35 for i G {2, . . . , n}. Then 
T(T,C, Sl ) < 2cn 2 . 

Proof. For all j G {1, . . . ,2n — 1}, let tj denote the expected time until the assembly containing s± 
grows by at least one tile, conditioned on the event that the current assembly is size at least j but 
less than 2n. By (6.1), (6.2), and (6.3) and the model of Markov process transition rates we employ 
to determine T(7~, C,si), it holds that tj < 1/5 = cn for all j G {1, . . . , 2n — 1}. By linearity of 
expectation, T(T, C, si) = E [T r ,c, Sl ] < Ej™! 1 tj < ( 2n ~ l)c« < 2 ^ 2 - □ 

6.2 General Techniques for Bounding Assembly Time 

Techniques from the proofs of Lemmas 6.2 and 6.4 can be generalized in the following way to 
ease analysis of assembly time of well-behaved hierarchical systems. The results of this section 
will be our main technical tools used to bound the assembly time of the shape of Theorem 6.1. 
Intuitively, if the tile system is "polyomino-robust" (defined below), in the sense that the seeded 
and hierarchical models result in essentially the same producible assemblies and are well-behaved in 
other ways, then we can bound the hierarchical assembly time in terms of the size of the structure 
and the number of tile types needed to assemble it, if concentrations are set appropriately. 
The TAS of Figure 4 has the following useful properties: 

1) It is directed. 

2) There is a constant q (q = 2 in Figure 4) such that producible assemblies not containing 
the "seed" s\ are of size at most q. We term such assemblies polyominos. For mathematical 
convenience, we treat individual tile types that are not part of any polyomino as if they are 
polyominos of size 1, and we call larger polyominos nontrivial polyominos. 

3) The set of producible assemblies containing si is precisely the same in the seeded model as in 
the hierarchical model, and furthermore every terminal producible assembly contains s\. (The 

20 We assign concentrations of &(—) to obey the finite density constraint since there are 0(n) distinct tile types. 
However, only 0(y/n) tile types are required to assemble a 2 x n bar [6, Theorem 3.2]. The concentrations of these 
tile types could be set to 0(-^), lowering the half-completion time from 0(n 2 ) to 0(n 1,5 ). Furthermore, since we 
later use the length-n bar to encode n bits, necessitating that each tile type on the top row be unique, we could not 
use 0(i/n) tile types anyway. 

21 We must choose <5 < 5n 1 _ 1 to obey the finite density constraint. 
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tile system is poly omino- safe, in the sense denned by Winfree [51].) In particular this implies 
that *4o[7~] = -4o[7^], where T s = (T, a, r) is the seeded version of T = (T, r) with a containing 
only s\. 

4) The polyominos that attach to (an assembly containing) s\ are a "total order (sub)system with 
respect to assemblies containing the seed" . More formally, define a maximal polyomino a to be 
a polyomino such that is not attachable to any assembly not containing s\. For each maximal 
polyomino a, there is a strict total order -< on dom a such that if p\ -< P2, then the tile at 
position pi always attaches to an assembly containing si by at least the time that the tile at 
position P2 attaches. 22 

5) Every tile type belongs to at most one type of maximal polyomino (which may appear in multiple 
locations in the terminal assembly), and appears exactly once in the polyomino. Here we include 
maximal polyominos of size 1, which means any tile type in a polyomino does not appear outside 
of the polyomino. More formally, for each polyomino a, \T(a)\ = \a\, where T(a) is the set 
of tile types in a, and for each pair of polyominos a and f3, a 7^ (3 ==> T(a) Pi T{f3) = 0. 
This implies in particular that any tile type in a non-trivial polyomino appears in the terminal 
assembly equally often as any other tile type in the same polyomino. 

Say that a tile system (possibly a subset of a larger tile system) that satisfies these properties 
is polyomino-robust. Most useful seeded tile systems tend to have these properties or are easily 
modified to have them. The two main tile subsystems that we analyze, shown in Figures 6 and 7, 
can be verified by inspection to obey these constraints. 

The property of polyomino-robustness allows us to reason about the system, in certain senses, 
as if it were a seeded system. Properties (2) and (4), in particular, allow us to set concentrations 
in such a way that we may assume that the concentration of individual tiles or polyominos that 
can extend an intermediate assembly are always at least a certain value bounded away from (5 
in Lemma 6.2, and <5i in Lemma 6.5). The trick is that tiles "further from the seed" (under the 
ordering -<) are always at least 5 greater concentration than tiles "closer to the seed", so that 
there will always be at least a 5 excess of them in solution, no matter what combinations of partial 
polyominos form before attaching to the seed. Property (2) implies that we may use a bounded 
interval of concentrations (from 5 to 35 in Figure 4) to achieve this. Given a maximal polyomino a 
and a tile type r in a, define dist Q (r) to be the distance of r from the minimal position (under -<) 
in the polyomino. (Since the polyomino is a linear chain, this number is well-defined). In Figure 4, 
for example, dista(j? r J = and distgj(p Si ) = 1, where p n and p Si are respectively the positions of 
ri and Si in the polyomino di (as defined in the proof of Lemma 6.2). 

Given an assembly j3 and a tile type r G T, define #73 (r) = | { p G dom j3 \ f3(p) = r } \ as the 
number of times r appears in /3. If /?, £ are assemblies such that /3 C (, define £ \ f3 to be the 
unique assembly 7 such that dom 7 = dom ( \ dom (3 and 7 C £• If /3 is an assembly and T is a 
tile set, define T(/3) = { r G T \ (Bp G dom j3) j3{p) = r } to be the set of tile types in /3. For any 
polyomino-robust TAS T = (T, r) , let 7^ = (T, a, r) denote the seeded version of the hierarchical 
system T, where a is the single-tile initial assembly consisting of only the tile s\. For any producible 
(in the seeded model) assembly a G «4.[7^], let A [a] = { (3 G A [%] \ a C /3 } denote the set of 
producible (in the seeded model) superassemblies of a, and let [„4,[a]](i) = X^/3e.A[a] ' #^( a )) 

22 Each polyomino is a "chain" with a well-defined tile "closest" to sj. Therefore, while pi and P2 may attach at 
the same time, it is always possible for the tile at pi to attach strictly sooner. In particular hierarchical growth is 
not required for assembly to proceed. 
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where #p(a) denotes the number of times that a appears as a subassembly of /3. That is, [^4[a]](t) 
is the total concentration of a or of assemblies containing a, where each duplicate appearance of a 
in a single superassembly contributes to the concentration separately. 23 

The next lemma shows conditions under which a partial assembly /3 grows into a superassembly 
£ □ /3. Informally, the lemma says that if we have "substantial" (at least 5q) concentration of /3 
(and its super assemblies), but the total concentration of seeds is "small" (at most 5 S1 ) compared 
to tile types that assemble an extension 7 of f3 (to create a superassembly of /3 called (), and if 
the concentrations of those tile types are set to ensure that all individual tile types have "excess" 
concentration (at least 5±), then a "substantial" concentration of £ will assemble in time linear in 

hi 

Lemma 6.5. Let T = (T, r) be a polyomino-robust hierarchical TAS with unique terminal assembly 
a. Let (3, ( G A[%] such that (3Q(. Let 7 = (\/3, and suppose that T(7) n T(3< \ 7) = 0. (i.e., tile 
types within 7 are unique to j), and that all polyominos contained in £ are completely contained 
in f3 or completely contained in 7. Suppose also that for all a E A[(3], #«(/?) = 1, and for all 
a G A[C], # a (C) = 1- Suppose that there exist tQ,c,5o,5 Sl S M + such that the following hold. 

• M(o) < 5 Sl . 

• [A[PWo) > S . 

• S S1 < o~oc- 

Set initial concentrations of all r £ T(-y) as follows. Let 5i > 0. 24 Set [r](0) = 5 Sl ■ # 7 (r) + 
(dist p (r) + l)-5i. 

Then for all t > ^£ + t , [A[Q]](t) > f . 

Proof. By Property (3) of polyomino-robustness, A[T S ] = { a £ A[T] \ a contains s\ }. Let ct\ C 
aj E ••• E ctk' £ A[T] be a sequence of polyominos contained in /3, with \at\ = \cti-\\ + 1, such 
that each aj's minimal position (under -<) is identical. In other words, if a is a maximal polyomino 
consisting of tile types r\,r2, ■ ■ ■ ,rk at positions p±,...,pk, with pi ~< P2 ~< ■ ■ ■ ~< Pk, then we 
consider sequences of the form a±, . . . , at> such that oti contains exactly the tiles rj, . . . , for 
some j. All of these are attachable to some producible assembly containing s± via the tile type rj. 
We allow k = 1 so that a\ can also represent any individual tile type r G ^(7) that is not part of 
a polyomino. Call such a sequence a polyomino attachment class. By our choice of concentrations, 
for all t £ M + and all polyomino attachment classes (a±, . . . , ov), 

fc' 

EHW^i' ( 6 - 6 ) 

This follows from the fact that each such assembly «j is consumed in a reaction in one of only 
two ways: 

23 For this definition to make sense, we must weight the sum by the number of times a appears in f3 because 
each time an assembly containing a binds to another assembly containing a, the number of assemblies containing a 
decreases by one, even though the total concentration of "completed q's" has stayed the same. However, whenever 
we actually apply this definition, it will be the case that = 1 for any producible /3 such that a IZ f3. 

Think of 61 = T ^ for common usage of the lemma. Intuitively, Si excess of each tile type is ensured, meaning 
we can think of <5i as the concentration of tile types in the seeded model. 
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1. in a reaction that binds rj to r,-_i (which have concentration at most S Sl ■ #7(^-1) + 
distp(rj-i) • Si = 8 S1 ■ # 7 (r,-_i) + (j - 1) • 5i, which equals 5 S1 • # 7 (r 3 -) + (j - 1) • <5i by 
our assumption of equal counts of tiles that are part of the same polyomino, which is less 
than [rj](0) by Si), or 

2. in a reaction producing another member of the same polyomino attachment class (thus not 
altering the sum of (6.6), just shifting its terms). This corresponds to the attachment of tiles 
"further from j3 under -<". 

Therefore, there will always be at least an excess of S± concentration of polyominos starting with r,-. 
In particular, for any producible assembly a containing j3, with frontier location p at which some 
tile type rj can attach in the seeded model, (6.6) ensures that the total concentration of polyominos 
that can attach to position p in the hierarchical model is at least Si. 

Recall that for all a G A[/3], # a ((3) = 1 and for all a G A[Q, #q(C) = 1- Then for all t G R+, 
[_4[/3]](t) = Z]q6^[/3] and = YlaeAK] [«](*)■ Since is monotonically increasing, 

for all t > to, 

lAPWo) = E N(*o) 

aeA[/3] 

< E ww 

= E [«](*)+ E 

ag^[C] a6-4[/3]V4[C] 

= lA(]](t)+ E [«](*)■ ( 6 - 7 ) 

ae-A[/?]\.4[C] 

For all a G *4[7^], define |a| 7 = |dom a n dom 7) to be the number of tiles in a that are part of 
7. For all t G M + , define 

s?w= E Nt- [«](*)• 

aeA[/3] 

Think of Sj(t) as the total "mass within 7" (concentration • (size within dom 7)) of assemblies 
that contain /3, noting that |a| 7 < \j\ for all a G ^4[7^]. Since (3 contains Si, this implies that 
[«4.[/3]](i) < S S1 for all t G M + by the bound on [si](0). Sl(t) is maximized when all seed tiles have 
been incorporated into £ or one of its superassemblies, which implies that 

maxS2(t) = max \ laL • \a](t) < I7I max N [a](t) = I7I max[^4[/?]l(i) < 5 S I7I. (6.8) 

aGA[/3] a£A[l3} 

Since every incomplete (with respect to completing () assembly a G A[(3] \^4[C] that contains j3 
can react with at least one polyomino attachment class as defined in (6.6), increasing the number 
of tiles occupying dom 7 by at least 1, it follows that for all t > to, 



dSjjt) 
dt 



> Si I E WW by (6.6) 

> 5i([A[mto)-[A[CW)) by (6.7) 

> SiSo-6i[A[(])(t). (6. 
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By the monotonicity of [.4[C]](t), it suffices to show that for t* = ^ + t , [A[(]](t*) > f . 
Suppose for the sake of contradiction that [y4[C]](i*) < Then by (6.9), for all t £ [io,i*], 

> Mo - *MC]](t) > This implies that Sj(t*) > ^(t* - t ) = = S c\j\ > 

^siItIj which contradicts (6.8). □ 

The following lemma extends the analysis of Lemma 6.5 to the case where we want to analyze 
multiple "stages" of assembly {p is the number of stages in the statement of Lemma 6.6), where 
each stage may involve the addition of tile types of asymptotically different concentrations than the 
other stages. Intuitively, this is required to prove fast assembly time because some stages (such as 
hard-coding the seed row of a counter with w(l) tile types) proceed slowly relative to their size since 
they require many tile types (hence each tile type has low concentration). This is not a problem 
since the size of such stages is small, but it implies that we cannot apply the "average attachment 
time per tile" of such a slow stage uniformly across the entire assembly. Some stages (such as 
completing a counter with a complete seed row with 0(1) tile types) proceed very quickly, and 
such stages account for "most" of the size of the final assembly, so that the total assembly speed is 
fast despite a few "bottleneck stages" in which the assembly process slows down for a short time. 

Lemma 6.6. Let T = (T, r) be a polyomino-robust hierarchical TAS with unique terminal assembly 
a. Let Pi, 02, ■ ■ ■ , /3 P £ A[T S ] such that /3j_i C for all i < p, where j3\ is the assembly with a 
single seed s\. Let 7, = /3j + i for each i < p, and supposed that T^^DT^j) = for all i,j < p, 
and that all polyominos contained in f3 p are completely contained in 7, for some i. (i.e., suppose 
that each j3 p and satisfy the hypothesis of Lemma 6.5, where /3 p +i in this lemma is interpreted 
to be C, in Lemma 6.5.) Suppose also that 5 Sl < j^-r. Set initial concentrations of all r 6 T(ji), 

where i < p as [r](0) = 8 S1 ■ # 7l (r) + (dist p (r) + 1) • p^-yy ■ 

Then for all t > £?=i 2 i \ li \\T{ li )\, [A[(3 p ]](t) > Furthermore, £ reT [r](0) < 1 +p(q + l), 

where q is the maximum size of any polyomino as in the definition of polyomino-robustness. 

Proof. We will show the statement [A[(3 m ]](t) > ^fer for all t > YT=i 2 i7ill T (7;)l is true for a11 
m < p by induction on m. 

This statement is trivially true for m = 1 since [-4,[/3i]](i) = [y4[si]](*) = 5 S1 for all t. 

Assume that the statement is true for m = mo- In other words, at time t = 2 J |7j||T(7j)|, 

[-4[An ]KO > z^F*- Invokin S Lemma 6.5 with f3 = /3 mo , 7 = 7 mo , t = YT=\ l 2 l7i|l T (7i)l and 

c = 2 m ^ 1 ,weknowthat[^[/? fno+ i]](t)>^_att = t + 2 m °|7m ll r (7m.o)| = E2r 2%\\T(ji)\. 
Therefore, the statement is true for m = mo + 1. 

The total concentration of tiles whose type is in T(/3 P ) is 

E W(°) = EE M(°) 

r£T(/3 p ) i=l r£T( 7l ) 

P 



= E [*« E #7,M+ E ( dist ^ r ) + 1 )-^i 

i=l \ r6T( 7l ) reT( 7l ) 1 UUI 

V V 

< E^il^l+ E max [distp(r) + 1] 

< 5 S i\Pp\ + V max [distp(r) + 1]. 

rgT( 7i ) 
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Since 6 S1 < jg-r, and all polyominos have size at most q, the total initial concentration of all tile 



In particular, if the number of stages p and polyomino size bound q are constant with respect 
to the size of the terminal assembly (call this parameter n; as in our diameter 0(n) rectangle of 
Theorem 6.1), then the total initial concentration of all tile types is constant with respect to n. In 
particular, we can scale these concentrations to obey the finite density constraint without affecting 
the asymptotic time and concentration bounds derived in Lemma 6.6. 

The following is a discrete version of Lemma 6.5 that can be used to analyze polyomino-robust 
systems in the discrete assembly time model. 

Lemma 6.7. Let T = (T,t) be as in Lemma 6.5, and let a G A[(3] be the current state of the 
Markov process defining the assembly time of T , and suppose that the current time is to and that 
P C a. Define T a ^ to be the random variable representing the first time at which a grows into a 

superassembly of Q. Define T(a, C) = E [T a ^]. Then T(a, () < jj- + to- 

Proof. For all j G {0, . . . , I7I}, let tj denote the expected time until the assembly adds at least one 
more tile to dom 7, conditioned on the event that current size of the assembly within dom 7 is j. 
By (6.6) and the model of Markov process transition rates we employ to determine T(7~, C,s\), it 
holds that tj < l/Si for all j G {0, . . . , I7I — 1}. By linearity of expectation, T(a, () = E [T a ,c] < 



6.3 Construction of a Fast- Assembling Shape 

This section describes the main components of the construction of Theorem 6.1. 

Figure 5 shows an overview of the assembly described in Theorem 6.1. It consists of an initial 
(blue) "vertical bar", and m copies of another type of (blue) vertical bar, each adjacent pair con- 
nected by a pair of (two different types of gray) "horizontal bars". The leftmost vertical bar forms, 
then the two horizontal bars attach, after which their right-side single strength glues cooperate to 
attach a new vertical bar to the right. This continues until the entire m "stages" are complete. 
In the meantime, filler tiles fill in the gaps to complete the rectangle. Figures 6 and 7 show some 
more detail of the tile types that create the horizontal and vertical bars, and Lemmas 6.8 and 6.9 
respectively show that these subassemblies have "substantial" concentration "quickly enough" to 
be useful to prove the time bound of Theorem 6.1. 

Figures 6 and 7 show details of the tile types that assemble the "horizontal bars" and "vertical 
bars" of Figure 5, and Lemmas 6.8 and 6.9 bound their assembly time. It may be beneficial for 
the reader first to skim the proof of Theorem 6.1, prior to examining Figures 6 and 7 in detail, in 
order to understand the intuitive purpose of the shape and outer glue placement of the horizontal 
and vertical bars. 

For all n G Z + , define log n = [lognj + 1, the number of bits required to represent n in binary, 
so that 2 log n is the next power of 2 greater than n. 

Lemma 6.8. Let T = (T,2) be the "horizontal bar with an arm" hierarchical TAS shown (by 
example) in Figure 6, and let a be its unique terminal assembly of an 0(w) x 0(log?/;) horizontal 
bar with an 0(\ogk) x 0(k) "vertical arm" on its left, where w and k are as in the proof of 
Theorem 6.1 (so that w > k in particular). Then there is an assignment of concentrations obeying 
^ rgT [r](0) = O(l) (with respect to k and w) such that for all t > 0(wlogw), [S](t) > ^( w ^ w )- 



types is at most 1 + p{q + 1). 



□ 
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stage 1 




stage 3 



Figure 5: High-level overview of interaction of "vertical bars" and "horizontal bars" to create the rectangle 
of Theorem 6.1. Filler tiles fill in the empty regions as shown in Figure 9. If glues overlap two regions then 
represent a formed bond. If glues overlap one region but not another, they are glues from the former region 
but are mismatched (and thus "covered and protected" ) by the latter region. 



Proof. Let pi E fa E Pz E Pa E fa respectively represent the assemblies encircled by dotted 
lines of Figure 6, so that P\ is just the tile s\, and P§ = a. Set the initial concentrations of 
tile types in T as in the statement of Lemma 6.6. Denning 71,..., 75 as in Lemma 6.6, note 
that |T( 7l )| = O(logw), |T( 72 )| = O(l), |T( 73 )| = O(logfc), |T( 74 )| = 0(1), | 7l | = OQogw), 
I72I = 0(wlogw), I73I = O(logfc), and I74I = 0(A;logA;). Therefore Lemma 6.6 tells us that for 
all i > 2 • 0(log 2 u;) + 4 • 0(wlogw) + 8 • 0(log 2 k) + 16 • O(klogk) = 0(wlogw) (since w > k), 
[A[a]](t) > 5 Sl /16. Setting 5 Sl = 2w ^ , w satisfies S S1 < tJ^t, so that the total concentration of tile 
types is at most 1 +p(q + l), where p = 5 and q = 4 in the tile system of Figure 6, where q is defined 
to be the maximum size of any polyomino as in the definition of polyomino-robustness, giving the 
required constant concentration bound. □ 

Lemma 6.9. Let T = (T,2) be the vertical bar hierarchical TAS shown (by example) in Figure 7, 
and let a be its unique terminal assembly of an 0(mk 2 ) x O(logfc) rectangle, where m and k are as 



30 



BiBBBl 
BIBlBB 
BB'B B 
BBBB 
BBBB 
BBBB 
DfUBBS 



oooooooooooooo 

iBlBlBBBBBBBBBBBBB 
"BBIBBBBBBBBBBBBB 
BBBIBBBBBBBBBBBB 
D £U & \ {£} B B ft S B B B B B B B 



increment 




copy 



► count /c (type A) or 
/f 2 (type B) rows 
(LSB on left) 



count w columns 
(LSB on bottom) 



Figure 6: "Horizontal bar with a vertical arm" for the construction of a fast-assembling square. The arm 
(assembled by a downward-growing counter similar to the horizontal counter, and set up to grow only after 
the horizontal counter is complete) is intended to block other horizontal bars from binding after this one 
has bound to a vertical bar. The arm has height either k (for type " A" horizontal bars of Figure 8) or k 2 
(for type "B" horizontal bars). Since we choose w — fc 4 , in either case the tile complexity and assembly 
time are dominated by the horizontal bar. The circles on the left and right indicate single-strength glues 
that are used when and after the bar binds hierarchically. The two west-facing single-strength glues cannot 
cooperate until the entire assembly is complete, so in particular the right glue must already be present. The 
east-facing single-strength glue is designed to cooperate with a glue from a different horizontal bar to control 
placement of a vertical bar, as in Figure 8. The north and south facing single-strength glues are designed to 
help with filler tiles or stage-counting as in Figure 9. 



in the proof of Theorem 6.1. a places mk 2 single strength "type B" (as in Figure 8 ) glues on its left 
and right ( all glues on the left identical, and all glues on the right identical to each other but different 
from the left glues), spaced every k vertical rows, and another k "type A" glues (as in Figure 8) 
spaced 2 rows apart on the top left and right. Also, there is an assignment of concentrations 
obeying X^reT M (0) = ^{1) (with respect to k and w), such that for all t > Q(mk 2 \og(mk)), 

Proof. The tiles are essentially zig-zag counters as described in [38]. Since single-strength glues 
must be placed in precise locations as required in Figures 5, 8, and 9, some modifications of the 
counter are necessary. There are three embedded counters 1, 2, and 3, counting to k, k, and m, 
respectively (m = k in our construction), respectively. Counter 1 bits are shown on the left side of 
each tile, counter 2 bits on the right, and counter 3 bits are omitted. Counter 1 increments each 
row, and when it rolls over, it resets and counter 2 increments once. Similarly, when counter 2 rolls 
over, it and counter 1 reset and counter 3 increments. The values m and k are embedded in the 
first two rows and carried through each subsequent row to enable the resets. These place single 
strength glues on each side (the left glues to allow the vertical bar to bind to two horizontal bars 
as show in Figure 8, and the right glues to help the orange counter in Figure 9 to correctly place 
glues on the right side of the vertical bar once it has attached. These single-strength glues are the 
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when all counters 
reach all 1's, count to k 
once more to place 
group A glues, spaced 
two apart 



when counters 1 and 2 
reach all 1's, reset 
counters 1 and 2 and 
increment counter 3 
, (counter 3 bits not 
shown) 



when counter 1 
reaches all 1's, reset 
counter 1, increment 
counter 2, place glues 
on each side 



Figure 7: Tiles to assemble a "vertical bar" as in Figures 5, 8, and 9 for the fast self-assembly of a rectangle. 
There are two types of vertical bars, the leftmost one with no glues on its west side (not shown), and the 
other vertical bars with glues each side (shown). 



"group B" glues of Figure 8. Finally, when counter 3 rolls over, a new counter (using new tile 
types) is initiated to count to 2k, placing a glue every other row, which are the "group A" glues 
of Figure 8. To ensure that the vertical arm of the bottommost horizontal bar does not protrude 
below the bottom of the vertical bar, it is necessary to first count k 2 rows without placing side 
glues, but for space reasons this is not shown. Also, the first vertical bar must count an additional 
logm rows, since each subsequent vertical bar will add this many rows to the bottom, as shown 
in Figure 9, when the stage computation counter must "crawl" below the bottom of the vertical 
bar in order to place the east-facing glues needed for the next-stage horizontal bars to bind to the 
vertical bar. 

Let Pi C /?2 E /?3 respectively represent the assemblies encircled by dotted lines of Figure 7, so 
that Pi is just the tile s\, and fa = a. Set the initial concentrations of tile types in T as in the 
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statement of Lemma 6.6. Denning 71, ... ,73 as in Lemma 6.6, note that |T(7i)| = <9(log /c + log m), 
1^(72)1 — 0(1), |7i| = 0(log k + log m), and I72I = (mk 2 (log k + log m)). Therefore Lemma 6.6 
tells us that for all t > 2 • 0((logk + logm) 2 ) + 4 • 0(mk 2 (logk + logm)) = 0(mk 2 log(mk)), 
[A[a]]{t) > 5 Sl /4. Setting <5 S1 = 2mfc a (log ^ +21ogfc) satisfies S S1 < and gives a total concentration 
of tile types at most 1 + p(q + 1), where p = 3 and q = 3 in the tile system of Figure 6, where q 
is defined to be the maximum size of any polyomino as in the definition of polyomino-robustness, 
giving the required constant concentration bound. □ 

We now prove the main theorem of this section, Theorem 6.1. 

Proof of Theorem 6.1. Based on the primitives introduced in Figures 4, 6, and 7 and their assembly 
time analysis, Figures 5, 8, and 9 outline the remainder of the construction. 

Intuitively, the construction proceeds as follows. Fix positive integers k, m, and w to be defined 
later. The rectangle grows rightward in m "stages", each stage of width w and height h = 0(mk 2 ). 
The speedup is obtained by using "binding parallelism" : the ability of a single (large) assembly /3 
to bind to multiple sites on another assembly a. Think of a as the structure built so far, with a 
vertical bar on its right end, and think of f3 as one of the horizontal bars shown in Figures 8 and 
5. This "binding parallelism" is in addition to the "assembly parallelism" described in Section 3: 
the ability for a to assemble in parallel with (3 so that (a large concentration of) (3 is ready to 
bind as soon as a is assembled. The number k controls the amount of "binding parallelism": it 
is the number of binding sites on a to which f3 may bind, the first of which binds in expected 
time ^ times that of the expected time before any fixed binding site binds (since the minimum of 
k exponential random variables of expected value t has expected value ^). Actually two different 
versions of f3 bind to one of two different regions on a, each with k binding sites. Because assembly 
may proceed as soon as each of the two regions has a f3 bound (so that no individual binding site 
is required before assembly can proceed), the system is not a partial order system; in fact it is not 
even directed since different filler tiles will fill in the other k — 1 regions where copies of (3 could 
have gone but did not. 

The timekeeper seed is contained in a height-/i "vertical bar" with a region of k pairs of single- 
strength glues on its top right, and another region of k (more widely spaced) pairs of single-strength 
glues on its right below the first region. Within each pair of glues, the two glues are different (despite 
being the same color in Figures 8 and 5). However, within the first region, each pair is the same. 
Within the second region, each pair of glues is also identical to all other pairs in the second region, 
although each is distinct from the pair of glues of the first region. This vertical bar assembles as 
in Figure 7, but rotated 90 degrees. The placement of the glues on the right is simple to calculate 
from the numbers k and m, which are embedded into the tile types. The width of the vertical bar 
is therefore 0(log k + log m) (although most figures show the vertical bar as "thin", it is more than 
one tile wide) and requires at most 0(logk + logm) tile types to encode. The leftmost vertical 
bar, having no west-facing glues, has distinct tile types from the other vertical bars, but all other 
vertical bars are identical to each other. 

The horizontal bar of Figure 6 also has two types, but unlike the vertical bar, both types appear 
in each stage: type A to bind in the top region and type B to bind in the bottom region of the 
vertical bar, each with different single-strength glues on each end from the other type. The vertical 
"arm" on the left side below the bar is intended to prevent other horizontal bars from binding 
to another pair of glues in the same region of a vertical bar. An order of growth is chosen that 
enforces that the entire horizontal bar and the arm must be assembled before the two strength-1 
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height = 4 
0{mf) 




horizontal bar type A 



k identical pairs of glues 
spaced 0(1) apart; 
"group A glues" 



previous stage lower 
horizontal bar attached 
to one of these k glues 



vertical bar as it 
appears before 
binding 



partial vertical 
bar (will 
complete after 
binding to two 
horizontal bars) 



width = w 



horizontal bar type B 



k identical pairs of glues 
(different glues from top) 
spaced 0(/c) apart; "group 
B glues" 



mk identical glues 
spaced 0(k) apart 



vertical bar after "post- 
binding processing" to 
place stage-specific 
right-side glues 



Figure 8: "Vertical bars" for the construction of a fast-assembling square, and their interaction with hori- 
zontal bars of Figure 6. "Type B" horizontal bars have a longer vertical arm than "Type A" since the glues 
they must block are farther apart. 



glues on the left are present. This is required for speed; if a horizontal bar could bind to the right 
of a vertical bar before the horizontal bar is complete, then it would have to complete (taking at 
least time w in the worst case) before growth of the larger assembly could continue. By ensuring 
that only complete horizontal bars can attach to vertical bars (and ensuring that many horizontal 
bars will be complete by the time "most" vertical bars require them), the assembly grows rightward 
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Figure 9: Detailed view of how a single partial vertical bar attaches completes into a full vertical bar after 
attaching to the previous stage's two horizontal bars. This enables the correct vertical placement of the two 
horizontal bars in the next stage. This is needed to communicate which stage is the current one based on 
the position of the previous lower horizontal bar, since there is only one type of vertical bar and it doesn't 
"know" the current stage. 



quickly. Since the right side of a horizontal bar has only a single strength-1 glue, two horizontal 
bars, one in each of regions A and B, are required to bind before the next stage's vertical bar may 
attach by using cooperative binding with each of these strength-1 glues. 

The following is a key idea in the construction: since there are only two kinds of horizontal bars, 
the glues on the right side cannot "communicate" the current stage. The natural solution to this, 
letting single-tile addition propagate the stage information from left to right along the horizontal 
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bar, defeats the purpose of letting the horizontal bar attach hierarchically: such a solution would 
slow down the assembly process to be no faster than the seeded model. To enforce that stage i + 1 
properly follows stage i (so that we deterministically stop after m stages), the relative position of 
the horizontal bars with respect to each other communicates the current stage. This is the reason 
that each stage "staggers" the vertical position of the group B region of k east-facing glues on the 
vertical bar, depending on the stage. 

In more detail, examine the single-strength glues on the left side of the vertical bar in Figure 9. 
The group A glues on the top left side of the vertical bar are each 2 spaces apart (to make room 
for their counterpart glues that will be placed on the right side, which are twice as numerous since 
they cooperate in adjacent pairs). The group B glues are each k spaces apart. This ensures that 
every vertical distance between a glue from group A and a glue from group B is unique. Note that 
all vertical bars except the initial vertical bar have more single-strength glues on the left than on 
the right. These allow the vertical bars to bind at any stage. However, by the distance-uniqueness 
property just explained, there is only one vertical position in which the vertical bar can bind, so 
that the vertical bar will be evenly lined up with the vertical bar from all the previous stages. 
Furthermore, it is possible, as shown in Figure 9, to use a counter to measure the relative height 
of the bottom horizontal bar in order to determine the current stage. The counter increments 
once for each group of k single-strength glues that it passes (this can be implemented by marking 
boundaries between groups of k glues with a special glue). 

As Figure 9 shows, once this counter reaches the bottom of the vertical bar and has value h to 
indicate that the previous stage was stage m — h, it uses h to determine where to place pairs of 
glues on the right side of the vertical bar for the next stage: they should placed at relative height 
h — 1. This construction processing of the left side of the vertical bar to assemble the right side of 
the vertical bar can be done with a constant number of tile types. When the value h is calculated 
to be 0, no right side of the vertical bar is constructed, since we have reached the final stage. As 
Figure 9 shows, filler tiles fill in the gaps above and below the horizontal bars after the vertical bar 
binds to the right. 

We now analyze of the assembly time of this construction. Although the TAS we describe is 
not directed, the subcomponents are deterministic, so we may apply Lemmas 6.5, 6.6 and 6.7 to 
them. 

We choose the timekeeper seed to be the same si as shown in Figure 7, for the alternate (not 
shown) version of the tiles that create the stage 1 vertical bar, with no glues on the left side. By 
our design in Figure 9, the only way a horizontal bar can attach to any other assembly is to attach 
to a complete vertical bar to its left that part of an assembly containing s\. 

Let Th_A, Tk-B) T v , and T vo be the tile types to create horizontal type A bars, horizontal type 
B bars, vertical bars (partial, those colored blue in Figures 8 and 9), and leftmost vertical bar, 
respectively. By the construction, we have |TL_a| = 0(log w + log k), |Th_B| = 0(log w + log k), and 
\T V \ = \T Vo \ = 0(logw + log A; + logm). 

For the general system, we add all four systems Th_A, Th-B) T v , and T vo . We also add a constant 
number of counter tile types that wrap around the vertical bar as in Figure 9, with each tile type 
having concentration (1) and a constant number of filler tile types also with total concentration 
0(1). In particular, assign the tile types concentrations as in the statement of Lemma 6.5, so that 
each tile type in the counter tiles and filler tiles (or polyomino attachment class) is guaranteed 
to have 0(1) excess concentration by assigning extra concentration to tile types in polyominos. 
(Although the details are not shown, it is trivial to implement the stage counter tiles with a 
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polyomino-robust system, and to implement the filler tiles with no polyominos at all.) This allows 
us to apply Lemma 6.7 to these subsystems with Si = S~2(l). 

Let s\ denote the seed of the whole tile system. Let 5 Sl = [si](0) = • For our choice of 
k and m, this will mean S Sl = O(p-). Let «h-A denote the type A "horizontal bar with an arm" 
assembly. Let cth-B denote the type B "horizontal bar with an arm" assembly. Let a vo denote the 
leftmost vertical bar assembly. Let a v denote the other vertical bar assembly. By Lemma 6.8, for all 
t > 0(w log w), the total concentration of type A horizontal bars that have been produced is as least 
^( miogw ) (although some may be incorporated into assemblies containing the seed, and similarly 
for type B vertical bars). Since at most 5 Sl m concentration of each horizontal bar can be attached 
to an assembly containing s\, this implies that for all t > 0(wlogw), [ah-A](i) > ^( w \ogw ) ~ ^si m i 
which is ^( ^io gw ) by our choice of 5 S1 . By the same reasoning, for all t > 0(wlogw), [a^sjit) > 
Q( mlo 1 gM ). Similarly, by Lemma 6.9 and similar reasoning regarding the scarcity of ^-containing 
assemblies that could attach to a v , for all t > Q.{mk 2 log(mfc)), [a v ](i) > ^( mfc^ log(mfc) )• 

Having derived these concentration bounds in the mass-action model, we now analyze the 
stochastic assembly time of s\ into a terminal assembly. 

By Lemma 6.7, the leftmost vertical bar's bottom two rows assemble in expected time 0(log k + 
logm), using the fact that each tile type in the bottom two rows can be assigned concentration 
^( logfc+iogm )' to ensure that <*i = ^ dogfc+iogm ) in Lemma 6.7. Again applying Lemma 6.7, the ex- 
pected time to assemble the complete leftmost vertical from its seed row is at most 0{mk 2 log(mk)), 
using the fact that there are O(l) tile types needed to complete the remaining rows and can be 
therefore be assigned concentration and in particular ensuring excess 5\ at least S7(l) in 

Lemma 6.7. 

Once the leftmost vertical bar completes, and if the current time t = £l(w log w), then by our 
above-derived bound on the concentration of «h-A> the expected time before a type A horizontal 
bar attaches to some binding site in the group A glues on the right side is at most O (w log w)/k. 
This holds similarly for cth—Bi so the expected time before both attachments happen is also at most 
0(wlogw)/k. Once two horizontal bars have attached, and if the current time t = ^l(mk 2 (log k + 
logm)), by our above-derived bound on the concentration of a v , the expected time before the 
next vertical bar attaches is at most 0(mk 2 (log k + logm)). Once this occurs, by Lemma 6.7, the 
expected time before the constant-size tile set shown in Figure 9 to complete the placement of 
glues on the east side of the just-attached vertical bar is at most O(mk 2 (logk + logm)). At this 
point the first stage is complete, requiring time at most 0(w\ogw) + 0(mk 2 (logk + logm)) (to 
wait for sufficient concentration of horizontal and vertical bars) +0(mfc 2 (log k + logm)) (to grow 
the leftmost vertical bar) +O (w log w)/k (to attach two horizontal bars) +0{mk 2 (log k + logm)) 
(to attach the second vertical bar) by linearity of expectation. Simplified, this is 0{wlogw) + 
0{mk 2 (log k + logm)) + 0(wlogw)/k. 

Repeating this analysis for each of the remaining stages, the total time for the complete "skele- 
ton" of Figure 8 to complete is at most m times the previous bound, O{wlogvo) + 0{m 2 k 2 (logk + 
logm)) + 0{mw log w)/k, by linearity of expectation. 

Finally, the filler tiles must tile the 3m empty regions left in the skeleton. Although this assembly 
process likely begins before the full skeleton is complete, we analyze it as if no filler tiles attach until 
the full skeleton is complete, as an upper bound for the actual assembly time. Each of these regions 
is a rectangle (minus the vertical arms of the horizontal bars) of diameter at most 0(w + mk 2 ), 
which is tiled by a constant-size rectilinear tile set (tile set in which each tile cooperates using north 
and east glues to grow towards the southwest), where each tile type has concentration 0(1) at all 
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times. By Theorem 4.4 of [3], the expected time for one of these regions to be completely tiled is 
at most 0(w + mk 2 ), with an exponentially decaying tail on the time distribution. Since there are 
3m such regions assembling in parallel, and each has an exponentially decaying tail, the time for 
all regions to completely fill is at most 0((w + mk 2 ) logm). 
Therefore the entire expected assembly time is at most 

0(w log w) + 0(m 2 k 2 (log k + logm)) + 0(mw log w)/k + 0((w + mk 2 ) log m). 

We choose k = m = re 1 / 5 and w = A; 4 . Simplifying the above expression, this gives an expected 
assembly time of 0(n 4//5 log n). □ 

By using the base-conversion technique of [3] for all counters, the number of tile types required 
could be reduced from O(logn) to the information-theoretically optimal 0( lo 1 ° 1 ^ rt ). However, this 
is a now-standard technique for obtaining optimal tile complexity of structures that "encode" a 
natural number n. Since the primary contribution of our construction is the bound on assembly 
time, we have presented a simpler (but larger than optimal) tile system for illustrative purposes. 
Unlike Theorem 3.1, where the problem of obtaining small assembly depth for a shape S is trivialized 
by allowing tile complexity \S\, obtaining fast assembly time is nontrivial whether tile complexity 
is large or not. In fact, small tile complexity helps to obtain fast assembly time, since with fewer 
tile types, one can distribute to each tile types a greater share of the 0(1) concentration allowed 
by the finite density constraint, which in turn reduces the expected time for each tile to attach. 

Also, it is possible to shave log factors from the assembly time analysis by using the "optimal 
counter" tiles of [3], which grow an n x logn counter in the seeded model in time 0(n), compared to 
the suboptimal O(nlogn) time required by the zig-zag counters we use. However, our Lemma 6.5 
does not take the "binding parallelism" of the seeded model into account, but instead implicitly 
assumes in the worst case that the frontier is always size 1. A more careful analysis could remove 
some of these log factors, but we have allowed the log factors in order to simplify the analysis, since 
our main goal is to obtain a sublinear time bound. 



7 Open Questions 

There are some interesting questions that remain open: 

1. What upper or lower bound can be placed on the quantity depth da (T) for T a hierarchical 
TAS that strictly self-assembles an n x n square with optimal tile complexity 0( ^°f^ n ) (or 
even with nearly-optimal tile complexity O(logn))? It is not obvious how to show either 
depth da (7~) = o(log 2 n) for some such T or depth da (T) = w(logri) for all such T ■ Obtaining 
bounds for more general shapes would also be interesting. 

2. What is the status of the following decision problems? 

T is a hierarchical TAS that 
strictly self-assembles finite shape S 



HierarchicalUniqueShape = | {S, T) 
HierarchicalDirectedUniqueShape = < (S, T) 



T is a directed hierarchical TAS that 
strictly self-assembles finite shape S 

In the case of the seeded aTAM, the seeded variants of these problems are known to be 
coNP-complete [6,20,30] and in P [5], respectively. 
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3. What is the status of the following decision problems? 



HierarchicalMinTileSet = 

(37" = (T, r)) T is a hierarchical TAS with 
\T\ < c and T strictly self-assembles finite shape S 



(S,c) 



(S,c) 



HierarchicalDirectedMinTileSet = 

(3T = (T, t)) T is a directed hierarchical TAS with 
\T\ < c and T strictly self-assembles finite shape S 

In the case of the seeded aTAM, the seeded variants of these problems are known to be 
Y.2 -complete [10] and NP-complete [5], respectively. 



4. What is the time complexity of strictly self-assembling annxn square with a hierarchical 
TAS? Any shape with diameter n? What if we require the TAS to be directed? 

5. Two asymptotically unrealistic aspects of the model are the assumption of a constant rate 
of diffusion of assemblies and a constant binding strength threshold required to bind two 
assemblies together. Large assemblies will diffuse more slowly in a well-mixed solution; some 
simple models predict that the diffusion rate of a molecule is inversely proportional to its 
diameter [9,36]. It is conceivable that an assembly model properly accounting for diffusion 
rates could enforce an absolute lower bound of Q(D) on the assembly time required to assemble 
any shape of diameter D. 

The binding strength threshold of the seeded aTAM is a simplified model of a more compli- 
cated approximation in the kinetic tile assembly model (kTAM, [49]). Tiles in reality will 
occasionally detach, but so long as their concentration is sufficient, another tile will reattach 
after not too much time. While our model accounts directly for concentrations of large as- 
semblies, it only accounts for this concentration up to the moment of first binding. A more 
realistic model might require a larger binding strength threshold to balance the fact that, 
if a large assembly detaches, it may take a long time to reattach. In particular, the seeded 
aTAM is justifiable as a model, despite its lack of reverse reactions or modeling of strength-1 
attachments (which happen in reality but have a higher reverse rate than higher strength 
attachments), in part due to Winfree's proof [49,50] that under suitable conditions (in par- 
ticular setting concentrations and binding energies such that the rate of forward attachments 
is just barely larger than the rate of backward detachments of strength- 2-bound tiles), the 
kTAM "simulates the aTAM with high probability." It is an open question whether there is 
any similar theorem that can be proven in the hierarchical aTAM, showing that detachment 
reactions may be safely ignored under certain conditions. 

Incorporating these and other physical phenomena into the hierarchical assembly model would 
be an interesting challenge. 
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A Appendix: Formal Definition of Abstract Tile Assembly Model 

This section gives a terse definition of the abstract Tile Assembly Model (aTAM, [49]). This is not 
a tutorial; for readers unfamiliar with the aTAM, [38] gives an excellent introduction to the model. 

Fix an alphabet S. S* is the set of finite strings over S. Given a discrete object O, (O) 
denotes a standard encoding of O as an element of £*. Z, Z + , N, M + denote the set of integers, 
positive integers, nonnegative integers, and nonnegative real numbers, respectively. For a set A, 
V(A) denotes the power set of A. Given ACT?, the full grid graph of A is the undirected graph 
G { A = (V,E), where V = A, and for all u,v G V, {u, v} G E <J=^> ||u — v\\2 = 1; i.e., if and only 
if u and v are adjacent on the integer Cartesian plane. A shape is a set S C Z 2 such that G { s is 
connected. 
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A tile type is a tuple t G (S* x N) 4 ; i.e., a unit square with four sides listed in some standardized 
order, each side having a glue label (a.k.a. glue) i G S* and a nonnegative integer strength. For a 
set of tile types T, let A(T) C X* denote the set of all glue labels of tile types in T. Let {N, S, E, W} 
denote the directions consisting of unit vectors {(0, 1), (0, —1), (1,0), (—1,0)}. Given a tile type t 
and a direction d G {N, S, E, W}, t(d) G A(T) denotes the glue label on t in direction d. We assume 
a finite set T of tile types, but an infinite number of copies of each tile type, each copy referred 
to as a tile. An assembly is a nonempty connected arrangement of tiles on the integer lattice Z 2 , 
i.e., a partial function a : Z 2 — ■* T such that G i doma is connected and dom a / 0. The shape 
of a is dom a. Write \a\ to denote |dom a\. Given two assemblies a, (3 : Z 2 — » T, we say a 
is a subassembly of /3, and we write a C /3, if dom a C dom /3 and, for all points p G dom a, 
a(p) = Pip). 

Given two assemblies a and /3, we say a and f3 are equivalent up to translation, written a ~ /3, 
if there is a vector x G Z 2 such that dom a = dom j3 -\- z (where for A C Z 2 , ^4 + z is defined to be 
{p + z|pGi}) and for all p G dom /3, a(p+z) = f3(p). In this case we say that j3 is a translation 
of a. Given a shape S* C Z 2 , we say that S is canonical if 5 C N 2 , (x,0) G S 1 for some x G N, and 
(0, y) £ S for some y G N. In other words, S is located entirely in the first quadrant, but at far to 
the left and down as possible. We say an assembly a is canonical if dom a is canonical. For each 
finite assembly a, there is exactly one canonical assembly 5 such that a ~ a. Given such a finite a, 
we say a is the canonical assembly of a. For brevity and clarity, we will tend to abuse notation and 
speak of assemblies equivalent up to translation as if they are the same object, often taking a to 
mean a, particularly when discussing concentrations. We have fixed assemblies at certain positions 
on Z 2 only for mathematical convenience in some contexts, but of course real assemblies float freely 
in solution and do not have a fixed position. 

Let a be an assembly and let p G dom a and d G {N, S, E, W} such that p + d G dom a. Let 
t = Clip) and t' = aip + d). We say that the tiles t and t' at positions p and p + d interact if 
tid) = t'(—d) and ^(i (<i)) > 0, i.e., if the glue labels on their abutting sides are equal and have 
positive strength. Each assembly a induces a binding graph G„, a grid graph G = iV a ,E a ), where 
V a = dom a, and {pi,P2} G E a <^=^> a(pi) interacts with a(p2). 25 Given r G Z + , a is T-stable 
if every cut of has weight at least r, where the weight of an edge is the strength of the glue 
it represents. That is, a is r-stable if at least energy r is required to separate a into two parts. 
When t is clear from context, we say a is stable. 

A.l Seeded aTAM 

A seeded tile assembly system (seeded TAS) is a triple T = (T, a, r), where T is a finite set of tile 
types, a : Z 2 — > T is the finite, r-stable seed assembly, and r G Z + is the temperature. Given two 
r-stable assemblies a, (3 : Z 2 — » T, we write a /3 if a E /3 and |dom /3 \ dom a\ = 1. In this 
case we say a T -produces (3 in one step. 2 ® If a —y[ (3, dom (3 \ dom a = {p}, and t = Pip), we 
write (3 = a + (p t). The T -frontier of a is the set c^a = U Q -> r ^ dom /3 \ dom a, the set of 
empty locations at which a tile could stably attach to a. 

25 For Ga oma = (Vdoma^doma) and Go = (Vc-Ea), is a spanning subgraph of G f doma : V a = V" do m Q and 

Ect ^ -Edom a • 

26 Intuitively a — y[ (3 means that a can grow into /3 by the addition of a single tile; the fact that we require both a 
and /? to be r-stable implies in particular that the new tile is able to bind to a with strength at least r. It is easy to 
check that had we instead required only a to be r-stable, and required that the cut of /3 separating a from the new 
tile has strength at least r, then this implies that /3 is also r-stable. 
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A sequence of k G Z + U {00} assemblies a = (ao,ai, . . .) is a T -assembly sequence if, for all 
1 < i < k, Oi-i ctj. We write a —t T (3, and we say a T-produces (3 (in or more steps) if there 
is a T-assembly sequence a = (ao, cti, . . .) of length k = |dom j3 \ dom a\ + 1 such that 1) a = a^, 
2) dom (3 = (Jo<i<fc dom aj, and 3) for all < i < A;, a.; C /3. In this case, we say that (3 is the result 
of d?, written f3 = res(a). If k is finite then it is routine to verify that res(5) = a^-i- 27 We say a is 
T -producible if a — a, and we write A[T] to denote the set of T-producible canonical assemblies. 
The relation -y is a partial order on A[T] [29, 37]. 28 A T-assembly sequence ao, a\, ... is /air 
if, for all i and all p G d^ai, there exists j such that aj(p) is defined; i.e., no frontier location is 
"starved" . 

An assembly a is T -terminal if a is r-stable and d^~a = 0. It is easy to check that an assembly 
sequence a is fair if and only res(a) is terminal. We write «4n[T] Q A[T] to denote the set of 
T-producible, T-terminal canonical assemblies. 

A seeded TAS T is directed (a.k.a., deterministic, confluent) if the poset (*4[7~], — is directed; 
i.e., if for each a, (3 G *4[T], there exists 7 G A[T\ such that a — ^ 7 and (3 — ^ 7. 29 We say that 
a TAS T strictly self-assembles a shape S 1 C Z 2 if, for all a G «4n[T|, dom a = S; i.e., if every 
terminal assembly produced by T has shape S. If T strictly self-assembles some shape S, we say 
that T is strict. Note that the implication "T is directed T is strict" holds, but the converse 
does not hold. 

A. 2 Hierarchical aTAM 

A hierarchical tile assembly system (hierarchical TAS) is a pair T = (T,t), where T is a finite 
set of tile types, and r G Z + is the temperature. Let a,/3 : Z 2 --■ *• T be two (possibly non- 
canonical) assemblies. Say that a and j3 are nonoverlapping if dom a n dom (3 = 0. If a and /3 
are nonoverlapping assemblies, define a U (3 to be the assembly 7 defined by 7(2?) = a{p) for all 
j> G dom a, 7(2?) = /3(p) for all p G dom (3, and 7(p) is undefined for all p£Z 2 \ (dom a U dom f3). 
An assembly 7 is singular if 7(p) = t for some p£Z 2 and some i G T and 7(p') is undefined for all 
p' G Z 2 \ {p}. Given a hierarchical TAS T = (T,t), an assembly 7 is T-producible if either 1) 7 is 
singular, or 2) there exist producible nonoverlapping assemblies a and f3 such that 7 = aU/3 and 7 
is r-stable. In the latter case, write a+ /3 — > 7. An assembly a is T-terminal if for every producible 
assembly (3 such that a and /3 are nonoverlapping, aU (3 is not r-stable. 30 Define *A[T] to be the set 
of all T-producible canonical assemblies. Define -4o[T] C A[T] to be the set of all T-producible, 
T-terminal canonical assemblies. A hierarchical TAS T is directed (a.k.a., deterministic, confluent) 
if |^4n[T]| = 1. We say that a TAS T strictly self-assembles a shape S C Z 2 if, for all a G «4rj[T], 
dom a = S; i.e., if every terminal assembly produced by T has shape S. 

Let T be a hierarchical TAS, and let a G A[T] be a T-producible assembly. An assembly tree 
T of S? is a full binary tree with |S| leaves, whose nodes are labeled by T-producible assemblies, 
with 3 labeling the root, singular assemblies labeling the leaves, and node u labeled with 7 having 

27 If we had defined the relation — > r based on only finite assembly sequences, then — > T would be simply the reflexive, 
transitive closure (— of — >f. But this would mean that no infinite assembly could be produced from a finite 
assembly, even though there is a well-defined, unique "limit assembly" of every infinite assembly sequence. 

28 In fact it is a partial order on the set of r-stable assemblies, including even those that are not T-producible. 

29 The following two convenient characterizations of "directed" are routine to verify. T is directed if and only if 
l*4n[TI = 1- T i s no * directed if and only if there exist a,p € A[T] and p G dom a n dom ft such that a(p) 7^ P(p). 

30 The restriction on overlap is a model of a chemical phenomenon known as stoic hindrance [46, Section 5.11] or, 
particularly when employed as a design tool for intentional prevention of unwanted binding in synthesized molecules, 
steric protection [24-26]. 
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children u\ labeled with a and U2 labeled with j3, with the requirement that a + (3 — > 7. That is, 
T represents one possible pathway through which a could be produced from individual tile types 
in T ■ Let T(7~) denote the set of all assembly trees of T ■ Say that an assembly tree is T -terminal 
if its root is a T-terminal assembly. Let Yn(7~) denote the set of all T-terminal assembly trees of 
T. Note that even a directed hierarchical TAS can have multiple terminal assembly trees that all 
have the same root terminal assembly. 

When T is clear from context, we may omit T from the notation above and instead write 
— >, da, frontier, assembly sequence, produces, producible, and terminal. We also assume without 
loss of generality that every positive-strength glue occurring in some tile type in some direction 
also occurs in some tile type in the opposite direction, i.e., there are no "effectively null" positive- 
strength glues. 
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